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and LIEVEN STUYVER 
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For: NEW SEQUENCES OF HEPATITIS C § 
VIRUS GENOTYPES AND THEIR USE AS § 
PROPHYLACTIC, THERAPEUTIC AND § 
DIAGNOSTIC AGENTS § 
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PRELIMINARY AMENDMENT 



Assistant Commissioner for Patents 



Washington, D.C. 20231 



Sir: 



EXPRESS MAIL MAILING LABEL 

NUMBER EM219702805US 
DATE OF DEPOSIT April 21, 1997 

I hereby certify that this paper or fee is being deposited with the 
United States Postal Service "EXPRESS MAIL POST OFFICE TO 
ADDRESSEE" service under 37 C.F.R. 1 . 1 0 on the date indicated 
above and is addressed to: Assistant Commissioner for Patents, 
Washington D.C. 20231. 




Signature 



Preliminary to examining the above referenced application, please amend the application as 
follows: 

IN THE CLAIMS: 

Please cancel claims 2, 3, 8, 20, 38, 40, 51, 53, 59, and 62. 

Please amend claims 1, 4-7, 9, 10, 13-15, 21-28, 30-36, 39,-41, 46-48, 54, and 61 as 
follows: 

1 . An HCV polynucleic acid, having a nucleotide sequence which is [unique to a theretofore 
unidentified HCV type or subtype which is] different from HCV subtypes la, lb, lc, Id, le, If 
lg, 2a, 2b, 2c, 2d, 7e. 2f 2g. 2h. 2i. 2k. 21. 3a, 3b, 3c, 3d, 3e, 3f, 3g, 4a, 4b, 4c, 4d, 43, 4f, 4g, 4h, 
4i, 4j, 4k. 41. 4m, 5a, [or] 6a, 7a. 7c. 7d. 9. 10. or 11. [with said HCV subtypes being classified as 
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in Table 3] by comparison of a part of the NS5 gene nucleotide sequence spanning positions 
7932 to 8271, [with said amino acid numbering being shown in Table 1 J and with said 
polynucleic acid containing at least one nucleotide differing from said known HCV nucleotide 
sequences, or the complement thereof. 

4. The [A] polynucleic acid according to [any of] claim[s] 1 [to 3] encoding an HCV 
polyprotein comprising in its amino acid sequence at least one of the following amino acid 
residues: 

115, C38, V44, A49, Q43, P49, Q55, A58, S60 or D60, E68 or V68, H70, A71 or Q71 or N71, 
D72, H81, H101, D106, S110, L130, 1134, E135, L140, S148, T150 or E150, Q153, F155, D157, 
G160, E165, 1169, F181, L186, T190, T192 or 1192 or H192, 1193, A195, S196, R197 or N197 
or K197, Q199 or D199 or H199 or N199, F200 or T200, A208, 1213, M216 or S216, N217 or 
S217 or G217 or K217, T218, 1219, A222, Y223, 1230, W231 or L231, S232 or H232 or A232, 
Q233, E235 or L235, F236 or T236, F237, L240 or M240, A242, N244, N249, 1250 or K250 or 
R250, A252 or C252, A254, 1255 or V255, D256 or M256, E257, E260 or K260, R261, V268, 
S272 or R272, 1285, G290 or F290, A291, A293 or W293, T294 or A294, S295 or H295, K296 
or E296, Y297 or M297, 1299 or Y299, 1300, S301, P316, S2646, A2648, G2649, A2650, 
V2652, Q2653, H2656 or L2656, D2657, F2659, K2663 or Q2663, A2667 or V2667, D2677, 
L2681, M2686 or Q2686 or E2686, A2692 or K2692, H2697, 12707, L2708 or Y2708, A2709, 
A2719 or M2719, F2727, T2728 or D2728, E2729, F2730 or Y2730, 12741, 12745, V2746 or 
E2746 or L2746 or K2746, A2748, S2749 or P2749, R2750, E2751, D2752 or N2752 or S2752 
or T2752 or V2752 or 12752 or Q2752, S2753 or D2753 or G2753, D2754, A2755, L2756 or 
Q2756, R2757, 

with said notation being composed of a letter representing the amino acid residue by its one-letter 
code, and a number representing the amino acid numbering as shown in Table 1, or a part of said 
polynucleic acid which is unique to at least one of the HCV subtypes or types as defined in 
claim[s 2 to 3] 1, and which contains at least one nucleotide differing from known HCV 
nucleotide sequences, or the complement thereof. 
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5. The [A] polynucleic acid according to [any of] claim[s] 1 [to 4], with said polynucleic 
acid encoding a HCV polyprotein comprising in its amino acid sequence at least one amino acid 
sequence chosen from the following list: 

ARQSDGRSWAQ or ARRSEGRSWAQ as for subtype Id (SEQ ID NO 107 and 108) 

ERRPEGRSWAQ as for subtype le ( SE Q ID N0 109 > 

ARRPEGRSWAQ as for subtype If ( SE Q 10 N0 1 10 > 

DRRTTGKSWGR as for subtype 2k ( SE Q ID NO 1 1 1) 

DRRATGRS WGR as for subtype 2e ( SE Q ID N0 1 1 2 ) 

DRRATGKSWGR as for subtype 2f ( SE Q ID N0 1 13 > 

VRQPTGRSWGQ as for type 9 ( SE Q ID N0 1 14) 

VRHQTGRTW AQ as for subtype 7a and 7c ( SE Q ID NO 1 1 5) 

VRQNQGRT W AQ as for subtype 7d ( SE Q ID NO 1 1 6) 

ARRTEGRSWAQ as for type 10 ( SE Q 10 N0 1 17 > 

VRRTTGRXXXX or VRRTTGRTW AQ as for type 1 1 (SEQ ID NO 1 1 8 and 1 1 9) 

HEVRNASGVYHVA or HEVRNASGVYHL as for subtype Id (SEQ ID NO 120 and 121) 

ARQSDGRSWAQ or ARRSEGRSWAQ as for subtype Id (SEQ ID NO 107 and 108) 

ERRPEGRSWAQ as for subtype le ( SE Q 1° NO 109) 

ARRPEGRSWAQ as for subtype If ( SE Q ID N0 1 10 > 

DRRTTGKSWGR as for subtype 2k ( SE Q ID NO 1 1 1) 

DRRATGRSWGR as for subtype 2e ( SE Q ID N0 1 12 > 

DRRATGKSWGR as for subtype 2f (SEQ ID NO 1 13) 

VRQPTGRSWGQ as for type 9 ( SE Q ID NO 114) 

VRHQTGRTW AQ as for subtype 7a and 7c (SEQ ID NO 1 1 5) 

VRQNQGRT WAQ as for subtype 7d ( SE Q ID NO 1 1 6) 

ARRTEGRSWAQ as for type 10 ( SE Q ID N0 1 17 > 

VRRTTGRXXXX or VRRTTGRTWAQ as for type 1 1 (SEQ ID NO 1 1 8 and 1 1 9) 

HEVRNASGVYHVA or HEVRNASGVYHL as for subtype Id (SEQ ID NO 120 and 121) 

YEVHSTTDGYHV as for subtype If ( SE Q ID N0 122 > 

VEVKNTSQAYMA as for subtype 2e ( SE Q ID N0 1 23 ) 

IQVKNNSHFYMA as for subtype 2f ( SE Q ro N0 124 ) 

VQVKNTSTMYMA as for subtype 2g ( SE Q tD NO 126) 

VQVANRSGSYMV as for subtype 2i ( SE Q ID NO 127) 

VEIKNTXNTYVL or VEIKNTSNTYVL as for subtype 2k (SEQ ID NO 128 and 129) 
INYRNVSGIYYV or INYRNTSGIYHV 

or INYHNTSGIYHI or TYYRNVSGIYHV as for subtype 4k (SEQ ID NO 130, 131, 132 or 133) 

QHYRN VSGIYHV as for subtype 41 ( SE Q ID N0 1 34 > 

IQVKNASGIYHL as for type 9 ( SE Q ID N0 135) 
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AHYTNKSGLYHL as for subtype 7c 
LNYANKSGLYHL as for subtype 7d 
LEYRNASGLYMV as for type 10 
IYEMDGMIHY or IYEMSGMILHA as for subtype Id 
VYEAKDIILHT as for subtype If 
VWQLXDAVLHV as for subtype 2e 
VWQLRDAVLHV as for subtype 2f 
IWQMQGAVLHV as for subtype 2g 
VWQLKDAVLHV as for subtype 2h 
VWQLEEAVLHV as for subtype 2i 
TWQLXXAVLHV as for subtype 2k 
VYEADHHILHL or VYEADHHILAL 
or VFEADHHILHL as for subtype 4k 
VYESDHHILHL as for subtype 41 
VFEAETMILHL as for type 9 
VYEAETLILHL as for subtype 7c 
VYEANGMILHL as for subtype 7d 
VYEAGDIILHL as for type 10 

VREDNHLRCWMAL or VRENNSSRCWMAL as for subtype Id 

IREGNISRCWVPL as for subtype If 

ENSSGRFHCWIPI as for subtype 2e 

ERSGNRTFCWTAV as for subtype 2f 

ELQGNKSRCWIPV as for subtype 2g 

ERHQNQSRCWIPV as for subtype 2h 

EWKDNTSRCWIPV as for subtype 2i 

EREGNSSRCWIPV as for subtype 2k 

VREGNQSRCWVAL or VRTGNQSRCWVAL 

or VRVGNQSSCWVAL VRVGNQSRCWVAL or VKEGNKSRCWVAL 
as for subtype 4k 

VKTGNTSRCWVAL as for subtype 41 
IKAGNESRCWLPV as for type 9 
VKEGNQ SRC W VQ A as for subtype 7c 
VKXXNLTKCWLSA as for subtype 7d 
VRSGNTSRCWIPV as for type 10 
VKNASVPTAA or VKDANVPTAA as for subtype Id 
ARIANAPIDE as for subtype If 
VSKPGALTKG as for subtype 2e 
VSRPGALTRG as for subtype 2f 

4 



(SEQ ID NO 136) 
(SEQ ID NO 137) 
(SEQ ID NO 138) 
(SEQ ID NO 139 and 140) 
(SEQ ID NO 141) 
(SEQ ID NO 142) 
(SEQ ID NO 143) 
(SEQ ID NO 144) 
(SEQ ID NO 145) 
(SEQ ID NO 146) 
(SEQ ID NO 147) 

(SEQ ID NO 148, 149 and 150) 

(SEQ ID NO 151) 

(SEQ ID NO 152) 

(SEQ ID NO 153) 

(SEQ ID NO 154) 

(SEQ ID NO 155) 

(SEQ ID NO 156 and 157) 

(SEQ ID NO 158) 

(SEQ ID NO 159) 

(SEQ ID NO 160) 

(SEQ ID NO 162) 

(SEQ ID NO 163) 

(SEQ ID NO 164) 

(SEQ ID NO 165) 

(SEQ ID NO 166, 167, 168 
or 169) 

(SEQ ID NO 170) 
(SEQ ID NO 171) 
(SEQ ID NO 172) 
(SEQ ID NO 173) 
(SEQ ID NO 174) 
(SEQ ID NO 175 and 176) 
(SEQ ID NO 177) 
(SEQ ID NO 178) 
(SEQ ID NO 179) 
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VNQPGALTRG as for subtype 2g (SEQ ID NO 1 80) 

VSQPGALTRG as for subtype 2h (SEQ ID NO 1 81) 

VSQPGALTKG as for subtype 2i (SEQ ID NO 1 82) 

VSRPGALTEG as for subtype 2k (SEQ ID NO 1 83) 

APYIGAPLES or APYTAAPLES as for subtype 4k (SEQ ID NO 1 84 and 1 85) 

APILSAPLMS as for subtype 4 1 (SEQ ID NO 1 86) 

VPNSS VPIHG as for type 9 (SEQ ID NO 1 87) 

VPNASTPVTG as for subtype 7c (SEQ ID NO 1 88) 

VQNAS VSIRG as for subtype Id (SEQ ID NO 1 89) 

VKSPCAATAS as for type 10 (SEQ ID NO 1 90) 

SPRMHHTTQE or SPRYL YHTTQE as for subtype 1 d (SEQ ID NO 1 9 1 and 1 92) 

TSRRHWTVQD as for subtype If (SEQ ID NO 193) 

APKRH YFVQE as for subtype 2e (SEQ ID NO 1 94) 

SPQYHTFVQE as for subtype 2f (SEQ ID NO 1 95) 

SPQHHNFSQD as for subtype 2g (SEQ ID NO 1 96) 

SPQHHIFVQD as for subtype 2h (SEQ ID NO 1 97) 

SPEHHHF VQD as for subtype 2k (SEQ ID NO 1 98) 
RPRRHWTTQD or RPRRHWTAQD or 

QPRRHWTTQD or RPRRH WTTQE as for subtype 4k (SEQ ID NO 1 99, 200, 

201 or 202) 

QPRRHWTVQD as for subtype 41 (SEQ ID NO 203) 

RPKYHQVTQD as for type 9 (SEQ ID NO 204) 

RPRMHQVVQE as for subtype 7c (SEQ ID NO 205) 

RPRMYEIAQD as for subtype 7d (SEQ ID NO 206) 

RHRQHWTVQD as for type 1 0 (SEQ ID NO 207) 

or a part of said polynucleic acid which is unique to at least one of the HCV subtypes or types as 
defined in claim[s 2 to 3] L and which contains at least one nucleotide differing from known 
HCV nucleotide sequences, or the complement thereof. 

6. The [A] polynucleic acid according to [any of] claim[s] 1 [to 5] having a sequence 
selected from any of SEQ ID NO 1 to 105, or a part of said polynucleic acid which is unique to at 
least one of the HCV subtypes or types as defined in claim[s 2 to 3] I, and which contains at 
least one nucleotide differing from known HCV nucleotide sequences, or the complement 
thereof. 
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1. The [A] polynucleic acid according to [any of] claim[s] 1 [to 6], which codes for the 5' 
UR, the Core/El, the NS4 or the NS5B region^ [or] a part thereo f, or the complement thereof , 

9. An oligonucleotide primer comprising part of a polynucleic acid according to any of 
claims L 4, 5, 6 or 7 [to 8], with said primer being able to act as primer for specifically 
amplifying the nucleic acid of a certain isolate belonging to the genotype from which the primer 
is derived. 

10. An oligonucleotide probe comprising part of a polynucleic acid according to any of 
claims L 4, 5, 6 or 7 [to 8], with said probe being able to act as a hybridization probe for specific 
detection and/or classification into types and/or subtypes of a HCV nucleic acid containing said 
nucleotide sequence, with said probe being possibly labeled or attached to a solid substrate. 

13. The [A] diagnostic kit according to claim 12, wherein said probe(s) is(are) attached to a 
solid substrate. 

14. The [A] diagnostic kit according to claim 13, wherein a range of said probes are attached 
to specific locations on a solid substrate. 

15. The [A] diagnostic kit according to claim 14, wherein said solid support is a membrane 
strip and said probes are coupled to the membrane in the form of parallel lines. 
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21. The [A] method according to claims 16 to 18, wherein said nucleic acids are labeled 
during or after amplification. 

22. A polypeptide having an amino acid sequence encoded by a polynucleic acid according to 
[any of] claim[s] 1 [to 8], or a part thereof which is unique to at least one of the HCV subtypes or 
types as defined in claim[s 2 or 3] \, and which contains at least one amino acid differing from 
any of the known HCV types or subtypes amino acid sequences, or an analog thereof being 
substantially homologous and biologically equivalent. 

23. The [A] polypeptide according to claim 22 comprising in its amino acid sequence at least 
one of the following amino acid residues: 

115, C38, V44, A49, Q43, P49, Q55, A58, S60 or D60, E68 or V68, H70, A71 or Q71 or N71, 
D72, H81, H101, D106, SI 10, L130, 1134, E135, L140, S148, T150 or E150, Q153, F155, D157, 
G160, E165, 1169, F181, L186, T190, T192 or 1192 or H192, 1193, A195, S196, R197 or N197 
or K197, Q199 or D199 or H199 or N199, F200 or T200, A208, 1213, M216 or S216, N217 or 
S217 or G217 or K217, T218, 1219, A222, Y223, 1230, W231 or L231, S232 or H232 or A232, 
Q233, E235 or L235, F236 or T236, F237, L240 or M240, A242, N244, N249, 1250 or K250 or 
R250, A252 or C252, A254, 1255 or V255, D256 or M256, E257, E260 or K260, R261, V268, 
S272 or R272, 1285, G290 or F290, A291, A293 or W293, T294 or A294, S295 or H295, K296 
or E296, Y297 or M297, 1299 or Y299, 1300, S301, P316, S2646, A2648, G2649, A2650, 
V2652, Q2653, H2656 or L2656, D2657, F2659, K2663 or Q2663, A2667 or V2667, D2677, 
L2681, M2686 or Q2686 or E2686, A2692 or K2692, H2697, 12707, L2708 or Y2708, A2709, 
A2719 or M2719, F2727, T2728 or D2728, E2729, F2730 or Y2730, 12741, 12745, V2746 or 
E2746 or L2746 or K2746, A2748, S2749 or P2749, R2750, E2751, D2752 or N2752 or S2752 
or T2752 or V2752 or 12752 or Q2752, S2753 or D2753 or G2753, D2754, A2755, L2756 or 
Q2756, R2757, 

with said notation being composed of a letter representing the amino acid residue by its one-letter 
code, and a number representing the amino acid numbering as shown in Table 1, or a part of said 
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polynucleic acid which is unique to at least one of the HCV subtypes or types as defined in 
claim[s 2 to 3] 1, and which contains at least one nucleotide differing from known HCV 
nucleotide sequences, or the complement thereof 

24. The [A] polypeptide according to claim 22 comprising in its amino acid sequence at least 
one of the sequences represented by SEQ ID NO 107 to 207 as listed in claim 5, or part of said 
polypeptide which is unique to at least one of the HCV subtypes or types as defined in claim[s 2 
to 3] I, and which contains at least one amino acid differing from known HCV types or subtypes 
amino acid sequences, or an analog thereof being substantially homologous and biologically 
equivalent to said polypeptide. 

25. The [A] polypeptide having an amino acid sequence as represented in any of SEQ ID NO 
1 TO 106, or a part thereof which is unique to at least one of the HCV subtypes or types as 
defined in claim[s 2 to 3] 1, and which contains at least one amino acid differing from known 
HCV types or subtypes amino acid sequences, or an analog thereof being substantially 
homologous and biologically equivalent to said polypeptide. 

26. The [A recombinant] polypeptide [encoded by a polynucleic acid] according to any of 
claims 1 . 4, 5. 6 or 7 which is recombinantlv produced [to 8, or a part thereof which is unique to 
at least one of the HCV subtypes or types as defined in claims 2 or 3, and which contains at least 
one amino acid differing from known HCV types or subtypes amino acid sequences, or an analog 
thereof being substantially homologous and biologically equivalent to said polypeptide]. 

27. A method for product of a recombinant polypeptide of claim 26, comprising: 
transformation of an appropriate cellular host with a recombinant vector, in which a 
polynucleic acid or a part thereof according to any of claims L 4. 5. 6, or 7 [to 8] has 
been inserted under the control of the appropriate regulatory elements, 

culturing said transformed cellular host under conditions enabling the expression of said 
insert, and, 
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harvesting said polypeptide. 

28. A recombinant expression vector comprising a polynucleic acid or a part thereof 
according to any of claims L 4, 5, 6 or 7 [to 8] operably linked to prokaryotic, eukaryotic, or 
viral transcription and translation control elements. 

30. A method for detecting antibodies to HCV present in a biological sample, comprising: 

a) contacting the biological sample to be analyzed for the present of HCV with a 

polypeptide according to any of claims 22 to [26] 25, 

b) detecting the immunological complex formed between said antibodies and said 

polypeptide. 

31. A method for HCV typing, comprising: 

a) contacting the biological sample to be analyzed for the presence of HCV with a 

polypeptide according to any of claims 22 to [26] 25. 

b) detecting the immunological complex formed between said antibodies and said 

polypeptide. 

32. A diagnostic kit for use in detecting the presence of HCV, said kit comprising at least one 
polypeptide according to any of claims 22 to [26] 25, with said polypeptide being possibly bound 
to a solid support. 

33. A diagnostic kit for HCV typing, said kit comprising at least one polypeptide according 
to any of claims 22 to [26] 25, with said polypeptide being possibly bound to a solid support. 

34. The [A] diagnostic kit according to claims 32 [to] or 33, said kit comprising a range of 
polypeptides which are attached to specific locations on a solid substrate. 

Preliminary Amendment 
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35. The [A] diagnostic kit according to claim[s 32 to] 34, wherein said solid support is a 
membrane strip and said polypeptides are coupled to the membrane in the form of parallel lines. 

36. A pharmaceutical composition comprising at least one polypeptide according to any of 
claims 22 to [26] 25 and a suitable excipient, diluent or carrier. 

39. A vaccine for immunizing a mammal against HCV infection, comprising at least one 
polypeptide according to claims 22 to [26] 25, in a pharmaceutically acceptable carrier. 

41 . A peptide corresponding to an amino acid sequence encoded by at least one of the HCV 
polynucleic acids according to any of claims 1 . 4. 5. 6 or 7 [to 8], with said peptide comprising 
an epitope being unique to at least one of the HCV subtypes or types as defined in claim[s 2 or 3] 
i, and with said peptide containing at least one amino acid differing from any of the know HCV 
types or subtypes amino acid sequences, or an analog thereof being substantially homologous 
and biologically equivalent. 

46. The [A] diagnostic kit according to claims 44 or 45, wherein said peptides are selected 
from the following list: 

at least one NS4 peptide, 

at least one NS4 peptide and at least one Core peptide, 

at least one NS4 peptide and at least one Core peptide and at least one El peptide or, 
at least one NS4 peptide and at least one El peptide. 

47. The [A] diagnostic kit according to claims 44 [to 46] or 45 , said kit comprising a range of 
peptides which are attached to specific locations on a solid substrate. 

48. The [A] diagnostic kit according to claims 44 [to 47] or 45 , wherein said solid support is 
a membrane strip and said peptides are coupled to the membrane in the form of parallel lines. 

Preliminary Amendment 
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54. An antibody raised upon immunization with at least one polypeptide or peptide according 
to any of claims 22 to [26] 25 or 41, with said antibody being specifically reactive with any of 
said polypeptides or peptides, and with said antibody being preferably a monoclonal antibody. 

61. A method of preventing or treating HCV infection, comprising administering the 
pharmaceutical composition of claim [62] 60 to a mammal in effective amount. 



The claims from the PCT application have been amended to conform to U.S. practice. 
Claims 1, 4-7, 9-19, 21-37, 39, 41-50, 52, 54-58, and 60-61 are now pending; and allowance of all 
claims is requested. 
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NEW SEQUENCES OF HEPATITIS C VIRUS GENOTYPES AND THEIR JJ SE_AS 
PROPHYLACTIC, THERAPEUTIC AND DIAGNOSTIC AGENTS 



The invention relates to new sequences of hepatitis C virus (HCV) genotypes 
and their use as prophylactic, therapeutic and diagnostic agents. 

The present invention relates to new genomic nucleotide sequences and amino 
acid sequences corresponding to the coding region of these genomes. The invention 
relates to new HCV types and subtypes sequences which are different from the 
known HCV types and subtypes sequences. More particularly, the present invention 
relates to new HCV type 7 sequences, new HCV type 9 sequences, new HCV types 
10 and new HCV type 1 1 sequences. Also the present invention relates to new HCV 
type 1 sequences of subtypes 1d, 1e, 1f and 1g; new HCV type 2 sequences of 
subtypes 2e, 2f, 2g, 2h, 2i, 2k and 21; new HCV type 3 sequences of subtype 3g, 
new HCV type 4 sequences of subtypes 4k, 41 and 4m; a process for preparing 
them, and their use for diagnosis, prophylaxis and therapy. 

The technical problem underlying the present invention is to provide new HCV 
sequences from untill now unknown HCV types and/or subtypes. More particularly, 
the present invention provides new type-specific sequences of the Core, the El and 
the NS5 regions of new HCV types 7, 9, 10 and 11, as well as of new variants 
(subtypes) of HCV types 1 , 2, 3 and 4. These new HCV sequences are useful to 
diagnose the presence of HCV type 1, and/or type 2, and/or type 3, and/or type 4, 
and/or type 7, and/or type 9, and/or type 10, and/or type 1 1 genotypes or serotypes 
in a biological sample. Moreover, the availability of these new type-specific 
sequences can increase the overall sensitivity of HCV detection and should also 
prove to be useful for prophylactic and therapeutic purposes. 

Hepatitis C viruses (HCV) have been found to be the major cause of non-A, 
non-B hepatitis. The sequences of cDNA clones covering the complete genome of 
several prototype isolates have been determined (Kato et al., 1990; Choo et al., 
1991; Okamoto et al., 1991; Okamoto et al., 1992). Comparison of these isolates 
shows that the variability in nucleotide sequences can be used to distinguish at least 
2 different genotypes, type 1 (HCV-1 and HCV-J) and type 2 (HC-J6 and HC-J8), 
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with an average homology of about 68%. Within each type, at least two subtypes 
exist (e.g. represented by HCV-1 and HCV-J), having an average homology of about 
79%. HCV genomes belonging to the same subtype show average homologies of 
more than 90% (Okamoto et al., 1992). However, the partial nucleotide sequence 
5 of the NS5 region of the HCV-T isolates showed at most 67% homology with the 
previously published sequences, indicating the existence of yet another HCV type 
(Mori et al., 1992). Parts of the 5' untranslated region (UR), core, NS3, and NS5 
regions of this type 3 have been published, further establishing the similar 
evolutionary distances between the 3 major genotypes and their subtypes (Chan et 
10 al., 1992). Type 4 was subsequently discovered (Stuyver et al., 1993b; Simmonds 
et al., 1993a; Bukh et al., 1993; Stuyver et al., 1994a). As well as type 5 (Stuyver 
et al., 1993b; Simmonds et al., 1993c; Bukh et al., 1993; Stuyver et al., 1994b), 
and type 6 HCV groups (Bukh et al., 1993; Simmonds et al., 1993c). An overview 
of the present state of the art regarding HCV genotypes is given in Table 3. The 
1 5 nomenclature system proposed by the inventors of the present application has now 
been accepted by scientists worldwide (Simmonds et al., 1994). 

The aim of the present invention is to provide new HCV nucleotide and amino 
acid sequences enabling the detection of HCV infection. 

Another aim of the present infection is to provide new nucleotide and amino 
20 acid HCV sequences enabling the classification of infected biological fluids into 
different serological groups. 

Another aim of the present invention is to provide new nucleotide and amino 
acid HCV sequences ameliorating the overall HCV detection rate. 

Another aim of the present invention is to provide new HCV sequences, useful 
25 for the design of HCV prophylactic or therapeutic vaccine compositions. 

Another aim of the present invention is to provide a pharmaceutical 
composition consisting of antibodies raised against the polypeptides encoded by 
these new HCV sequences, for therapy or diagnosis. 

All the aims of the present invention are met by the following embodiments 
30 of the present invention. 

The present invention relates more particularly to an HCV polynucleic acid, 
having a nucleotide sequence which is unique to a heretofore unidentified HCV type 
or subtype which is different from HCV subtypes 1a, 1b, 1c, 2a, 2b, 2c, 2d, 3a, 3b, 
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3c, 3d, 3e, 3f, 4a, 4b, 4c, 4d, 4e, 4f, 4g, 4h, 4i, 4j, 5a or 6a, with said HCV 
subtypes being classified as in Table 3 by comparison of a part of the NS5 gene 
nucleotide sequence spanning positions 7932 to 8271, with said amino acid 
numbering being shown in Table 1 , and with said polynucleic acid containing at least 
one nucleotide differing from said known HCV nucleotide sequences, or the 
complement thereof. The sequence of known HCV isolates may be found in any 
nucleotide sequence database known in the art (such as for instance the EMBL 
database). 

The present invention thus also relates to a polynucleic acid having a 
nucleotide sequence which is unique to at least one of HCV subtypes 1d, 1e, 1f, 1g, 
2e, 2f, 2g, 2h, 2i, 2k, 21, 3g, 4k, 41, 4m, 7a, 7c or 7d, with said HCV subtypes 
being classified as defined above. 

The present invention thus also relates to a polynucleic acid having a 
nucleotide sequence which is unique to at least one of HCV types 9, 10 or 1 1 , with 
said HCV types being classified as defined above. 

It is to be noted that the nucleotide(s) difference in the polynucleic acids of 
the invention may involve an amino acid difference in the corresponding amino acid 
sequences encoded by said polynucleic acids. A composition according to the 
present invention may contain only polynucleic acid sequences or polynucleic acid 
sequences mixed with any excipient known in the art of diagnosis, prophylaxis or 
therapy. 

According to a preferred embodiment, the present invention relates to a 
polynucleic acid encoding an HCV polyprotein comprising in its amino acid sequence 
at least one of the following amino acid residues: 

115, C38, V44, A49, Q43, P49, Q55, A58, S60 or D60, E68 or V68, H70, A71 or 
Q71 or N71, D72, H81, H101, D106, S110, L130, 1134, E135, L140, S148, T150 
or E150, Q153, F155, D157, G160, E165, 1169, F181, L186, T190, T192 or 1192 
or H192, 1193, A195, S196, R197 or N197 or K197, Q199 or D199 or H199 or 
N199, F200 or T200, A208, 1213, M216 or S216, N217 or S217 or G217 or K217, 
T218, 1219, A222, Y223, I230, W231 or L231, S232 or H232 or A232, Q233, 
E235 or L235, F236 or T236, F237, L240 or M240, A242, N244, N249, I250 or 
K250 or R250, A252 or C252, A254, I255 or V255, D256 or M256, E257, E260 
or K260, R261, V268, S272 or R272, I285, G290 or F290, A291, A293 or L293 
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or W293, T294 or A294, S295 or H295, K296 or E296, Y297 or M297, 1299 or 
Y299, 1300, S301, P316, S2646, A2648, G2649, A2650, V2652, Q2653, H2656 
or L2656, D2657, F2659, K2663 or Q2663, A2667 or V1667, D2677, L2681, 
M2686 or Q2686 or E2686, A2692 or K2692, H2697, I2707, L2708 or Y2708, 
5 A2709, A2719 or M2719, F2727, T2728 or D2728, E2729, F2730 or Y2730, 
12741, I2745, V2746 or E2746 or L2746 or K2746, A2748, S2749 or P2749, 
R2750, E275 1 , D2752 or N2752 or S2752 or T2752 or V2752 or I2752 or Q2752, 
S2753 or D2753 or G2753, D2754, A2755, L2756 or Q2756, R2757, 
with said notation being composed of a letter representing the amino acid residue by 
10 its one-letter code, and a number representing the amino acid numbering according 
to Kato et al. (1980), as shown in Table 1, 

or a part of said polynucleic acid which is unique to at least one of the HCV subtypes 
or types as defined in Table 5, and which contains at least one nucleotide differing 
from known HCV nucleotide sequences, or the complement thereof. 
1 5 Each of the above-mentioned residues can be found in Figures 2, 4 or 6 

showing the new amino acid sequences of the present invention aligned with known 
sequences of other types or subtypes of HCV for the Core/E1 region. 

According to another preferred embodiment, the present invention relates to 
a polynucleic acid encoding a HCV polyprotein comprising in its amino acid sequence 
20 at least one amino acid sequence chosen from the following list: 

ARQSDGRSWAQ or ARRSEGRSWAQ as for subtype 1d (SEQ ID NO 107 

and 108) 



ERRPEGRSWAQ as for subtype 1e 


(SEQ 


ID 


NO 


109) 


ARRPEGRSWAQ as for subtype 1f 


(SEQ 


ID 


NO 


110) 


DRRTTGKSWGR as for subtype 2k 


(SEQ 


ID 


NO 


111) 


DRRATGRSWGR as for subtype 2e 


(SEQ 


ID 


NO 


112) 


DRRATGKSWGR as for subtype 2f 


(SEQ 


ID 


NO 


113) 


VRQPTGRSWGQ as for type 9 


(SEQ 


ID 


NO 


114) 


VRHQTGRTWAQ as for subtype 7a and 7c 


(SEQ 


ID 


NO 


115) 


VRQNQGRTWAQ as for subtype 7d 


(SEQ 


ID 


NO 


116) 


ARRTEGRSWAQ as for type 10 


(SEQ 


ID 


NO 


117) 


VRRTTGRXXXX or VRRTTGRTWAQ as for type 1 1 


(SEQ 


ID 


NC 


i 118 



and 1 1 9) 
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HEVRNASGVYHV or HEVRNASGVYHL as for subtype 1d 


(SEQ ID NO 120 




and 121) 






YEVHSTTDGYHV as for subtype 1f 


(SEQ ID NO 122) 


0 


VEVKNTSQAYMA as for subtype 2e 


(SEQ ID NO 123) 


X 

5 


IQVKNNSHFYMA as for subtype 2f 


(SEQ ID NO 124) 




VQVKNTSTMYMA as for subtype 2g 


(SEQ ID NO 125) 




VQVKNTSHSYMV as for subtype 2h 


(SEQ ID NO 126) 




VQVANRSGSYMV as for subtype 2i 


(SEQ ID NO 127) 




VEIKNTXNTYVL or VEIKNTSNTYVL as for subtype 2k 


(SEQ ID NO 128 


10 


and 129) 






INYRNVSGIYYV or INYRNTSGIYHV or INYHNTSGIYHI or TNYRNVSGIYHV as 




Tor suuiype h-k 


/ccn m Kin i^n 




131, 132 or 133) 






QHYRNVSGIYHV as for subtype 41 


(SEQ ID NO 1 34) 


las? 

H 15 


IQVKNASGIYHL as for type 9 


(SEQ ID NO 135) 




AHYTNKSGLYHL as for subtype 7c 


(SEQ ID NO 136) 




LNYANKSGLYHL as for subtype 7d 


(SEQ ID NO 137) 




LEYRNASGLYMV as for type 10 


(SEQ ID NO 138) 




IYEMDGMIMHY or IYEMSGMILHA as for subtype 1d 


(SEQ ID NO 139 


SI 20 


and 140) 






VYEAKDIILHT as for subtype 1f 


(SEQ ID NO 141) 




VWQLXDAVLHV as for subtype 2e 


(SEQ ID NO 142) 




VWQLRDAVLHV as for subtype 2f 


(SEQ ID NO 143) 




IWQMQGAVLHV as for subtype 2g 


(SEQ ID NO 144) 


25 


VWQLKDAVLHV as for subtype 2h 


(SEQ ID NO 145) 




VWQLEEAVLHV as for subtype 2i 


(SEQ ID NO 146) 




TWQLXXAVLHV as for subtype 2k 


(SEQ ID NO 147) 




VYEADHHILHL or VYEADHHILAL or VFEADHHILHL as for subtupe 4k (SEQ 




ID NO 148, 149 and 150) 




30 


VYESDHHILHL as for subtype 41 


(SEQ ID NO 




151) 






VFEAETMILHL as for type 9 


(SEQ ID NO 152) 




VYEAETLILHL as for subtype 7c 


(SEQ ID NO 



SUBSTiiUiE SHEET (RULE 26) 



WO 96/13590 



PCT7EP95/04155 





o 

153) 






VYEANGMILHL as for subtype 7d 


(SEQ ID NO 154) 


m 


VYEAGDIILHL as for type 10 


(SEQ ID NO 155) 


9 


VREDNHLRCWMAL or VRENNSSRCWMAL as for subtype 1d 


5 


(SEQ ID NO 156 and 157) 




IREGNISRCWVPL as for subtype 1f 


(SEQ ID NO 158) 




ENSSGRFHCWIPI as for subtype 2e 


(SEQ ID NO 159) 




ERSGNRTFCWTAV as for subtype 2f 


(SEQ ID NO 160) 




ELQGNKSRCWIPV as for subtype 2g 


(SEQ ID NO 162) 


10 


ERHQNQSRCWIPV as for subtype 2h 


(SEQ ID NO 163) 




EWKDNTSRCWIPV as for subtype 2i 


(SEQ ID NO 164) 




EREGNSSRCWIPV as for subtype 2k 


(SEQ ID NO 165) 


w 


VREG NQSRCWVAL or VRTG N QSRCWVAL or 


VRVGNQSSCWVAL or 




VRVGNQSRCWVAL or VKEGNHSRCWVAL as for subtype 4k 




(SEQ ID NO 166, 167, 168 or 169) 




VKTGNTSRCWVAL as for subtype 41 


(SEQ ID NO 170) 




IKAGNESRCWLPV as for type 9 


(SEQ ID NO 171) 


Hi 


VKEGNQSRCWVQA as for subtype 7c 


(SEQ ID NO 172) 




VKXXNLTKCWLSA as for subtype 7d 


(SEQ ID NO 173) 


N 20 


VRSGNTSRCWIPV as for type 10 


(SEQ ID NO 174) 




VKNASVPTAA or VKDANVPTAA as for subtype 1d 


(SEQ ID NO 175 




and 176) 






ARIANAPIDE as for subtype 1f 


(SEQ ID NO 177) 




VSKPGALTKG as for subtype 2e 


(SEQ ID NO 178) 


25 


VSRPGALTRG as for subtype 2f 


(SEQ ID NO 179) 




VNQPGALTRG as for subtype 2g 


(SEQ ID NO 180) 




VSQPGALTRG as for subtype 2h 


(SEQ ID NO 181) 




VSQPGALTKG as for subtype 2i 


(SEQ ID NO 182) 




VSRPGALTEG as for subtype 2k 


(SEQ ID NO 183) 


30 


APYIGAPLES or APYTAAPLES as for subtype 4k 


(SEQ ID NO 184 and 185) 




APILSAPLMS as for subtype 41 


(SEQ ID NO 186) 




VPNSSVPIHG as for type 9 


(SEQ ID NO 187) 




VPNASTPVTG as for subtype 7c 


(SEQ ID NO 188) 
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VQNASVSIRG as for subtype 7d (SEQ ID NO 189) 

VKSPCAATAS as for type 10 (SEQ ID NO 190) 

SPRMHHTTQE or SPRLYHTTQE as for subtype 1d (SEQ ID NO 191 and 192) 
TSRRHWTVQD as for subtype 1f (SEQ ID NO 193) 

5 APKRHYFVQE as for subtype 2e (SEQ ID NO 1 94) 

SPQYHTFVQE as for subtype 2f (SEQ ID NO 195) 

SPQHHNFSQD as for subtype 2g (SEQ ID NO 196) 

SPQHHIFVQD as for subtype 2h (SEQ ID NO 197) 

SPEHHHFVQD as for subtype 2k (SEQ ID NO 198) 

1 0 RPRRHWTTQD or RPRRHWTAQD or QPRRHWTTQD or RPRRHWTTQE as for 

subtype 4k (SEQ ID NO 199, 200, 201 or 202) 

QPRRHWTVQD as for subtype 41 (SEQ ID NO 203) 

RPKYHQVTQD as for type 9 (SEQ ID NO 204) 

RPRMHQVVQE as for subtype 7c (SEQ ID NO 205) 

15 RPRMYEIAQD as for subtype 7d (SEQ ID NO 206) 

RHRQHWTVQD as for type 10 (SEQ ID NO 207) 

or a part of said polynucleic acid which is unique to at least one of the HCV subtypes 
or types as defined Table 5, and which contains at least one nucleotide differing 
from known HCV nucleotide sequences, or the complement thereof. 
20 Using the 5' non-coding Li PA system (Stuyver et al., 1993) and a new core 

Lj PA system including multiple probes for subtypes 1 a, 1 b, 1 c, 2a, 2b or 2c derived 
from the core region (Stuyver et al., 1995), samples from the Benelux, Cameroon, 
France and Vietnam were selected because of their aberrant reactivities (isolates 
CAM1078, FR2, FR1 , VN4, VN12, VN13, NE98). Some samples were, together with 
25 many other samples, sequenced as a control for typing. Sequencing results, 
however, indicated the discovery of new subtypes (isolates BNL1 , BNL2, BNL3, FR4, 
BNL4, BNL5, BNL6, BNL7, BNL8, BNL9, BNL10, BNL1 1 and BNL12). Nucleotide 
sequences in the core and E1 regions which have not yet been reported before, were 
analyzed in the frame of the invention. Genomic sequences of subtype 1d, 1e, 1f, 
30 1g 2e, 2f, 2g, 2h, 2i, 2k, 21, 3g, 4k, 41, 4m, 7a, 7c, 7d and types 9, 10 and 11 
isolates are reported for the first time in the present invention. The NS5B region was 
also analyzed. 

The term "polynucleic acid" refers to a single- stranded or double-stranded 
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nucleic acid sequence which may contain at least 5 contiguous nucleotides in 
common with the complete nucleotide sequence (e.g. at least 6, 7,8,9,10,11,12, 
1 3, 1 4, 1 5, 1 6, 1 7, 1 8, 1 9, 20, 21 , 22, 23, 24, 25, 30, 35, 40, 45, 50, 60, 75 or 
more contiguous nucleotides). A polynucleic acid which is up till about 100 
5 nucleotides in length is often also referred to as an oligonucleotide. A polynucleic 
acid may consist of deoxyribonucleotides or ribonucleotides, nucleotide analogues 
or modified nucleotides, or may have been adapted for therapeutic purposes. A 
polynucleic acid may also comprise a double stranded cDNA clone which can be used 
for cloning purposes, or for in vivo therapy, or prophylaxis. 

TO The oligonucleotides according to the present invention, used as primers or 

probes may also contain or consist of nucleotide analogous such as 
phosphorothioates (Matsukura et al., 1987), alkylphosphoriates (Miller et al., 1979) 
or peptide nucleic acids (Nielsen et al., 1991; Nielsen et al., 1993) or may contain 
interculating agents (Asseline et al., 1984). 

15 As most other variations or modifications introduced into the original DNA 

sequences of the invention these variations will neccissitate adaptions with respect 
to the conditions under which the oligonucleotide should be used to obtain the 
required specificty and sensitivity. However the eventual results will be essentially 
the same as those obtained with the unmodified oligonucleotides. 

20 The introduction of these modifications may be advantageous in order to 

positiviiy influence characteristics such as hybridization kinetics, reversibility of the 
hybrid-formation, biological stability of the oligonucleotide molecules, etc. 

The polynucleic acids of the invention may be comprised in a composition of 
any kind. Said composition may be for diagnostic, therapeutic or prophylactic use. 

25 The expression "sequences which are unique to an HCV type or subtype" 

refers to sequences which are not shared by any other type or subtype of HCV, and 
can thus be used to uniquely detect that HCV type or subtype. Sequence variability 
is demonstrated in the present invention between the newly found HCV types and 
subtypes (see Table 5) and the known HCV types and subtypes (see Table 3), and 

30 it is therefore from these regions of sequence variability in particular that type- or 
subtypes-specific polynucleic acids, oligonucleotides, polypeptides and peptides may 
be obtained. The term type- or subtypes-specific refers to the fact that a sequence 
is unique to that HCV type or subtype involved. 
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The expression "nucleotides corresponding to" refers to nucleotides which are 
homologous or complementary to an indicated nucleotide sequence or region within 

a specific HCV sequence. 

The term "coding region" corresponds to the region of the HCV genome that 
encodes the HCV polyprotein. In fact, it comprises the complete genome with the 
exception of the 5' untranslated region and 3' untranslated region. 

The term "HCV polyprotein" refers to the HCV polyprotein of the HCV- J 
isolate (Kato et al., 1990). The adenine residue at position 330 (Kato et al., 1990) 
is the first residue of the ATG codon that initiates the long HCV polyprotein of 301 0 
amino acids in HCV-J and other type 1 b isolates, and of 301 1 amino acids in HCV-1 
and other type 1a isolates, and of 3033 amino acids in type 2 isolates HC-J6 and 
HC-J8 (Okamoto et al., 1992). 

This adenine is designated as position 1 at the nucleic acid level, and this 
methionine is designated as position 1 at the amino acid level, in the present 
invention. As type 1 a isolates contain 1 extra amino acid in the NS5A region, coding 
sequences of type 1a and 1b have identical numbering in the Core, E1, NS3, and 
NS4 region, but will differ in the NS5B region as indicated in Table 1 . Type 2 isolates 
have 4 extra amino acids in the E2 region, and 17 or 18 extra amino acids in the 
NS5 region compared to type 1 isolates, and will differ in numbering from type 1 
isolates in the NS3/4 region and NS5b regions as indicated in Table 1. Similar 
insertions compared with type 1 (but of a different size) can also be observed in type 
3a sequences which affect the numbering of type 3a amino acids accordingly. Other 
insertions or deletions may be readily observed in typel, 2, 3, 4, 5, 6, 7, 8, 9, 10 
and 1 1 sequences after alignment withknown HCV sequences. 

TABLE 1 





Region 


Positions 
described 

in 

the 

present 
invention* 


Positions 
described for 
HCV-J 
(Katoetal., 
1990) 


Positions 
described for 
HCV-1 
(Choo et al., 
1991) 


Positions 
described for 
HC-J6, HC- 
J8 

(Okamoto et 
al., 1992) 
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10 



Nucleotides 


NS5B 


8023/8235 
7932/8271 


8352/8564 
8261/8600 


8026/8238 
7935/8274 


8433/8645 
8342/8681 






coding 
region 

of present 
invention 


330/9359 


1/9033 


342/9439 


Amino 
Acids 


NS5B 


2675/2745 
2645/2757 


2675/2745 
2645/2757 


2676/2746 
2646/2758 


2698/2768 
2668/2780 



Table 1 : Comparison of the HCV nucleotide and amino acid numbering system 
used in the present invention (*) with the numbering used for other 
prototype isolates. For example, 8352/8564 indicates the region 
designated by the numbering from nucleotide 8352 to nucleotide 8564 
as described by Kato et al. (1990). Since the numbering system of the 
present invention starts at the polyprotein initiation site, the 329 
nucleotides of the 5' untranslated region described by Kato et al. 
(1990) have to be substracted, and the corresponding region is 
numbered from nucleotide 8023 ('8352-329') to 8235 ('8564-329'). 



The term "genotype" as used in the present invention refers to both types 
and/or subtypes. 

15 The term "HCV type" corresponds to a group of HCV isolates of which the 

complete genome shows more than 73% preferably more than 74% homology at the 
nucleic acid level, or of which the NS5 region between nucleotide positions 7932 
and 8271 shows more than 75.4% homology at the nucleic acid level, or of which 
the complete HCV polyprotein shows more than 78% homology at the amino acid 

20 level, or of which the NS5 region between amino acids at positions 2645 and 2757 
shows more than 80% homology at the amino acid level, to polyproteins of the other 
isolates of the group, with said numbering beginning at the first ATG codon or first 
methionine of the long HCV polyprotein of the HCV-J isolate (Kato et al., 1990). 
Isolates belonging to different types of HCV exhibit homologies, over the complete 

25 genome, of less than 74%, preferably less than 73%, at the nucleic acid level and 
less than 78% at the amino acid level. Isolates belonging to the same type usually 
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show homologies of about 90 to 99% at the nucleic acid level and 95 to 96% at the 
amino acid level when belonging to the same subtype, and those belonging to the 
same type but different subtypes preferably show homologies of about 76% to 82% 
(more particularly of about 77% to 80%) at the nucleic acid level and 85-86% at the 
amino acid level. 

More preferably the definition of HCV types is concluded from the 
classification of HCV isolates according to their nucleotide distances calculated as 
detailed below: 

(1) based on phylogenetic analysis of nucleic acid sequences in the NS5B 
region between nucleotides 7935 and 8274 (Choo et al., 1991) or 8261 and 8600 
(Kato et al., 1990) or 8342 and 8681 (Okamoto et al., 1991), isolates belonging to 
the same HCV type show nucleotide distances of less than 0.34, usually less than 
0.33, and more usually of less than 0.32, and isolates belonging to the same 
subtype show nucleotide distances of less than 0. 1 35, usually of less than 0. 1 3, and 
more usually of less than 0.125, usually ranging between 0.0003 and 0.1 1 51 , and 
consequently isolates belonging to the same type but different subtypes show 
nucleotide distances ranging from 0.135 to 0.34, usually ranging from 0.1384 to 
0.2977, and more usually ranging from 0.15 to 0.32, and isolates belonging to 
different HCV types show nucleotide distances greater than 0.34, usually greater 
that 0.35, and more usually of greater than 0.358, more usually ranging from 
0.3581 to 0.6670. 

(2) based on phylogenetic analysis of nucleic acid sequences in the core/E1 
region between nucleotides 378 and 957, isolates belonging to the same HCV type 
show nucleotide distances of less than 0.38, usually of less than 0.37, and more 
usually of less than 0.364, and isolates belonging to the same subtype show 
nucleotide distances of less than 0.17, usually of less than 0.16, and more usually 
of less than 0.15, more usually less than 0.135, more usually less than 0.134, and 
consequently isolates belonging to the same type but different subtypes show 
nucleotide distances ranging from 0.15 to 0.38, usually ranging from 0.16 to 0.37, 
and more usually ranging from 0.17 to 0.36, more usually ranging from 0.133 to 
0.379, and isolates belonging to different HCV types show nucleotide distances 
greater than 0.34, 0.35, 0.36, usually more than 0.365, and more usually of greater 
than 0.37, 
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Region 


Core/El 
579 bp 


El 

3 ©4 Dp 


NS5B 
340 bp 


NS5B 
222 bp 


Isolates* 


0.0017 - 0.1347 
(0.0750 ± 0.0245) 


0.0026 - 0.203 1 
(0.0969 ± 0.0289) 


0.0003 - 0.1151 
(0.0637 ± 0.0229) 


0.000 - 0.1323 
(0.0607 ± 0.0205) 


Subtypes* 


0.1330 - 0.3794 
(0.2786 ± 0.0363) 


0.1645 - 0.4869 
(0.3761 ± 0.0433) 


0.1384 - 0.2977 
(0.2219 ± 0.0341) 


0.117 - 0.3538 
(0.2391 ± 0.0399) 


Types* 


0.3479 - 0.6306 
(0.4703 ± 0.0525) 


0.4309 - 0.9561 
(0.630S ± 0.0928) 


0.3581 - 0.6670 
(0.4994 ± 0.0495) 


0.3457 - 0.7471 
(0.5295 ± 0.0627) 



Tab | e 2 * Figures created by the PHYLIP program DNADIST are expressed 
as minimum to maximum (average _+ standard deviation). 
Phylogenetic distances for isolates belonging to the same 
subtype ('isolates'), to different subtypes of the same type 
TO ('subtypes'), and to different types ("types') are given. 

In a comparative phylogenetic analysis of available sequences, ranges of 
molecular evolutionary distances for different regions of the genome were calculated, 
based on 19,781 pairwise comparisons by means of the DNADIST program of the 
phylogeny inference package PHYLIP version 3.5c (Felsenstein, 1993). The results 

1 5 are shown in Table 2 and indicate that although the majority of distances obtained 
in each region fit with classification of a certain isolate, only the ranges obtained in 
the 340bp NS5B-region are non-overlapping and therefore conclusive. However, as 
was performed in the present invention, it is preferable to obtain sequence 
information from at least 2 regions before final classification of a given isolate. 

20 Designation of a number to the different types of HCV and HCV nomenclature 

is based on chronological discovery of the different types. The numbering system 
used in the present invention might still fluctuate according to international 
conventions or guidelines. For example, "type 4" might be changed into "type 5" or 
"type 6". Also the arbitrarily chosen border distances between types and subtypes 

25 and isolates may still be subject to change according to international guidelines or 
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conventions. Therefore types 7a, 8a, 8b, 9a may for example be designated 6b, 6c, 
6d, and 6d in the future; and type 10a which shows relatedness with genotype 3 
may be denoted 3g instead of 1 0a. 

The term "subtype" corresponds to a group of HCV isolates of which the 
complete polyprotein shows a homology of more than 90% both at the nucleic acid 
and amino acid levels, or of which the NS5 region between nucleotide positions 
7932 and 8271 shows a homology of more than 90% at the nucleic acid level to the 
corresponding parts of the genomes of the other isolates of the same group, with 
said numbering beginning with the adenine residue of the initiation codon of the HCV 
polyprotein. Isolates belonging to the same type but different subtypes of HCV show 
homologies of more than 74% at the nucleic acid level and of more than 78% at the 
amino acid level. 

It is to be understood that extremely variable regions such as the E1 , E2 and 
NS4 regions will exhibit lower homologies than the average homology of the 
complete genome of the polyprotein. 

Using these criteria, HCV isolates can be classified into at least 1 1 types. 
Several subtypes can clearly be distinguished in types 1 , 2, 3, 4 and 7 : 1 a, 1 b, 1c, 
1d, 1e, 1f, 1g, 2a, 2b, 2c, 2d, 2e, 2f, 2g, 2h, 2i, 2k, 21, 3a, 3b, 3c, 3d, 3f, 3g, 4a, 
4b, 4c, 4d, 4e, 4f, 4g, 4h, 4i, 4j, 4k, 41, 4m, 7a, 7c, and 7d based on homologies 
of the 5' UR and coding regions. An overview of most of the reported isolates and 
their proposed classification according to the typing system of the present invention 
as well as other proposed classifications is presented in Table 3. 



Table 3 



HCV CLASSIFICATION 

OKA- CHA NAKAO PROTOTYPE 

MOTO MORI 

I pt GI HCV-1, HCV-H, HC-J1 



la I 
lb II " KI 



lc 

2a III HI K2a GUI HC-J6 

2b IV IV K2b GUI HC-J8 



Gil HCV-J, HCV-BK, HC V-T, HC-JK1 , HC-J4, 

HCV-CHINA 

HC-G9 
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2c 
2d 

3a V 

3b 
3c 
3d 
3e 
3f 
4a 
4b 
4c 
4d 
4e 
4f 

4g 
4h 
4i 

4j 
5a 
6a 
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S83, ARG6, ARG8, 110, T983 
NE92 

v K3 GIV BR36, BR56, HD10, N2L1, BR33, Ta, E-bl 

VI K3 GIV HCV-TR, Tb> NE137 

NE48 
NE274 
NE145 
NE125 
Z4, GB809-4 
Zl 

GB116, GB358, GB215, Z6 y Zl 
DK13 

GB809-2, CAM600, CAM736 

CAM622, CAM627 

GB549 

GB438 

CAR4/1205 

CAR1/905 

GV SA3, SA4, SA1, SA7, SA11, BE95 

HK1, HK2, HK3, HK4, VN11 



Tab l e 3 Overview of the known HCV types and subtypes classified according 
to the different authors. 



The term "complement" refers to a nucleotide sequence which is 
complementary to an indicated sequence and which is able to hybridize to the 

indicated sequences. 

The composition of the invention can comprise many combinations. By way 
of example, the composition of the invention can comprise: 

two (or more) nucleic acids from the same region or, 

two nucleic acids (or more), respectively from different regions, for the same 
isolate or for different isolates, 

or nucleic acids from the same regions and from at least two different regions 
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(for the same isolate or for different isolates). 

The present invention relates particularly to a polynucleic acid as defined 
above having a sequence selected from any of SEQ ID NO 1 , 3, 5, 7, 9, 1 1 , 1 3, 1 5, 
17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39 , 41, 43, 45, 47, 49, 51, 53, 55, 
5 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 
97, 99, 101, 103 to 105, or a part of said polynucleic acid which is unique to any 
of the HCV subtypes or types as defined in Table 5, and which contains at least one 
nucleotide differing from known HCV polynucleic acids, or the complement thereof. 
The present invention relates more particularly to a polynucleic acid as defined 
1 0 above, which codes for the 5' UR, the Core/E1 , the NS4 or the NS5B region or a part 
thereof. 

More particularly, the present invention relates to a polynucleic acid as defined 
above which is a cDNA sequence. 

Also included within the present invention are sequence variants of the 

1 5 polynucleic acids as selected from any of the nucleotide sequences as given in any 
of the above given SEQ ID numbers with said sequence variants containing either 
deletion and/or insertions of one or more nucleotides, especially insertions or 
deletions of 1 or more codons, mainly at the extremities of oligonucleotides (either 
3' or 5'), or substitutions of some non-essential nucleotides (i.e. nucleotides not 

20 essential to discriminate between different genotypes of HCV) by others (including 
modified nucleotides an/or inosine), for example, a type 1 or 2 sequence might be 
modified into a type 7 sequence by replacing some nucleotides of the type 1 or 2 
sequence with type-specific nucleotides of type 7 as shown in for instance Figure 
1 and 2. 

25 Particularly preferred variant polynucleic acids of the present invention include 

also sequences which hybridise under stringent conditions with any of the 
polynucleic acid sequences of the present invention. Particularly, sequences which 
show a high degree of homology (similarity) to any of the polynucleic acids of the 
invention as described above. Particularly sequences which are at least 80%, 85%, 

30 90%, 95% or more homologous to said polynucleic acid sequences of the invention. 
Preferably said sequences will have less than 20%, 15%, 10%, or 5% variation of 
the original nucleotides of said polynucleic acid sequence. 

Polynucleic acid sequences according to the present invention which are 
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homologous to the sequences as represented by a SEQ ID NO can be characterized 
and isolated according to any of the techniques known in the art, such as 
amplification by means of sequence-specific primers, hybridization with sequence- 
specific probes under more or less stringent conditions, serological screening 
5 methods or via the Li PA typing system. 

Other preferred variant polynucleic acids of the present invention include 
sequences which are redundant as a result of the degeneracy of the genetic code 
compared any of the above-given polynucleic acids of the present invention. These 
variant polynucleic acid sequences will thus encode the same amino acid sequence 

1 0 as the polynucleic acids they are derived from. 

Also included within the scope of the present invention are 5" non-coding 
region sequences which can be readily obtained from type 1 subtype 1d, 1e, 1f or 
1g isolates; type 2 subtype 2e, 2f, 2g, 2h, 2i, 2k or 21 isolates; type 3 subtype 3g 
isolates; type 4 subtype 4k, 41 or 4m isolates; type 7 subtype 7a, 7c or 7d isolates, 

15 type 9, type 10 or type 1 1 isolates discribed herein. Such sequences may contain 
type or subtype-specific motifs which can be employed for type and/or subtype- 
specific hybridization assays, e.g. such as described by Stuyver et al. (1993). 

Polynucleic acid sequences of the genomes indicated above from regions not 
yet depicted in the present examples, figures and sequence listing can be obtained 

20 by any of the techniques known in the art, such as amplification techniques using 
suitable primers from the sequences of these new genomes given in Figure 1 of the 
present invention. 

The present invention also relates to an oligonucleotide primer comprising part 
of a polynucleic acid as defined above, with said primer being able to act as a primer 

25 for specifically amplifying the nucleic acid of a certian HCV isolate belonging to the 
genotype from which the primer is derived. 

The term "primer" refers to a single stranded DNA oligonucleotide sequence 
capable of acting as a point of initiation for synthesis of a primer extension product 
which is complementary to the nucleic acid strand to be copied. The length and the 

30 sequence of the primer must be such that they allow to prime the synthesis of the 
extension products. Preferably the primer is about 5-50 nucleotides. Specific length 
and sequence will depend on the complexity of the required DNA or RNA targets, as 
well as on the conditions of primer use such as temperature and ionic strength. 
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The fact that amplification primers do not have to match exactly with 
corresponding template sequence to warrant proper amplification is amply 
documented in the literature (Kwok et al., 1990). 

The amplification method used can be either polymerase chain reaction (PCR; 
5 Saiki et al., 1 988), ligase chain reaction {LCR; Landgren et al., 1 988; Wu & Wallace, 
1989; Barany, 1991), nucleic acid sequence-based amplification (NASBA; Guatelli 
et al., 1990; Compton, 1991), transcription-based amplification system (TAS; Kwoh 
et ai., 1989), strand displacement amplification (SDA; Duck, 1990; Walker et al., 
1992) or amplification by means of replicase (Lizardi et al., 1988; Lomeli et al., 

1 0 1 989) or any other suitable method to amplify nucleic acid molecules using primer 
extension. During amplification, the amplified products can be conveniently labelled 
either using labelled primers or by incorporating labelled nucleotides. Labels may be 
isotopic ( 32 P, 35 S, etc.) or non-isotopic (biotin, digoxigenin, etc.). The amplification 
reaction is repeated between 20 and 70 times, advantageously between 25 and 45 

15 times. 

The present invention also relates to an oligonucleotide probe comprising part 
of a polynucleic acid as defined above, with said probe being able to act as a 
hybridization probe for specific detection and/or classification into types and/or 
subtypes of an HCV nucleic caid containing said nucleotide sequence, with said 
20 probe being possibly labelled or attached to a solid substrate. 

The term "probe" refers to single stranded sequence-specific oligonucleotides 
which have a sequence which is complementary to the target sequence of the HCV 
genotype(s) to be detected. 

Preferably, these probes are about 5 to 50 nucleotides long, more preferably 
25 from about 10 to 25 nucleotides. 

The term "solid support" can refer to any substrate to which an 
oligonucleotide probe can be coupled, provided that it retains its hybridization 
characteristics and provided that the background level of hybridization remains low. 
Usually the solid substrate will be a microtiter plate, a membrane (e.g. nylon or 
30 nitrocellulose) or a microsphere (bead). Prior to application to the membrane or 
fixation it may be convenient to modify the nucleic acid probe in order to facilitate 
fixation or improve the hybridization efficiency. Such modifications may encompass 
homopolymer tailing, coupling with different reactive groups such as aliphatic 
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groups, NH 2 groups, SH groups, carboxylic groups, or coupling with biotin or 
haptens. 

The present invention also relates to a diagnostic kit for use in determining the 
genotype of HCV, said kit comprising a primer as defined above. 
5 The present invention also relates to a diagnostic kit for use in determining the 

genotype of HCV, said kit comprising a probe as defined above. 

The present invention also relates to a diagnostic kit as defined above, wherein 
said probe(s) is(are) attached to a solid substrate. 

The present invention also relates to a diagnostic kit as defined above, wherein 
10 a range of said probes is attached to specific locations on a solid substrate. 

The present invention also relates to a diagnostic kit as defined above, wherein 
said solid support is a membrane strip and said probes are coupled to the membrane 
in the form of parallel lines. 

The present invention also relates to a method for the detection of HCV 
1 5 nucleic acids present in a biological sample, comprising: 

(i) possibly extracting sample nucleic acid, 

(ii) amplifying the nucleic acid with at least one primer as defined above, 

(iii) detecting the amplified nucleic acids. 

The present invention also relates to a method for the detection of HCV 
20 nucleic acids present in a biological sample, comprising: 

(i) possibly extracting sample nucleic acid, 

(ii) possibly amplifying the nucleic acid with at least one primer as defiend 
above, or with a universal HCV primer, 

(iii) hybridizing the nucleic acids of the biological sample, possibly under 
25 denatured conditions, at appropriate conditions with one or more probes as 

defined above, with said probes being preferably attached to a solid 
substrate, 

(iv) possibly washing at appropriate conditions, 

(v) detecting the hybrids formed. 

30 The present invention also relates to a method for detecting the presence of 

one or more HCV genotypes present in a biological sample, comprising: 

(i) possibly extracting sample nucleic acid, 

(ii) specifically amplifying the nucleic acid with at least one primer as defined 
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above, 

(iii) detecting said amplified nucleic acids. 

The present invention also relates to a method for detecting the presence of 
one or more HCV genotypes present in a biological sample, comprising: 
5 (i) possibly extracting sample nucleic acid, 

(ii> possibly amplifying the nucleic acid with at least one primer as defined 

above or with a universal HCV primer, 

(iii) hybridizing the nucleic acids of the biological sample, possibly under 
denatured conditions, at appropriate conditions with one or more probes as 

10 defined above, with said probes being preferably attached to a solid 

substrate, 

(iv) possibly washing at appropriate conditions, 

(v) detecting the hybrids formed, 

(vi) inferring the presence of one or more HCV genotypes present from the 
1 5 observed hybridization pattern. 

The present invention also relates to a method as defined above, wherein said 
probes are further characterized as defined above. 

The present invention also relates to a method as defined above, wherein said 
nucleic acids are labelled during or after amplification. 
20 Preferably, this technique could be performed in the 5' non-coding, Core or 

NS5B region. 

The term "nucleic acid" can also be referred to as analyte strand and 
corresponds to a single- or double-stranded nucleic acid molecule. This analyte strand 
is preferentially positive- or negative stranded RNA, cDNA or amplified cDNA. 
25 The term "biological sample" refers to any biological sample (tissue or fluid) 

containing HCV nucleic acid sequences and refers more particularly to blood serum 
or plasma samples. 

The term "universal HCV primer" refers to oligonucleotide sequences 
complementary to any of the conserved regions of the HCV genome. 
30 The expression "appropriate" hybridization and washing conditions are to be 

understood as stringent and are generally known in the art (e.g. Maniatis et al., 
Molecular Cloning: A Laboratory Manual, New York, Cold Spring Harbor Laboratory, 
1982). 
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However, according to the hybridization solution (SSC, SSPE, etc.), these 
probes should be hybridized at their appropriate temperature in order to attain 
sufficient specificity. 

The term "labelled" refers to the use of labelled nucleic acids. This may include 
5 the use of labelled nucleotides incorporated during the polymerase step of the 
amplification such as illustrated by Saiki et al. (1988) or Bej et al. (1990) or labelled 
primers, or by any other method known to the person skilled in the art. 

The process of the invention comprises the steps of contacting any of the 
probes as defined above, with one of the following elements: 
10 - either a biological sample in which the nucleic acids are made available for 

hybridization, 

- or the purified nucleic acids contained in the biological sample 

- or a single copy derived from the purified nucleic acids, 

- or an amplified copy derived from the purified nucleic acids, with said 
1 5 elements or with said probes being attached to a solid substrate. 

The expression "inferring the presence of one or more HCV genotypes present 
from the observed hybridization pattern" refers to the identification of the presence 
of HCV genomes in the sample by analyzing the pattern of binding of a panel of 
oligonucleotide probes. Single probes may provide useful information concerning the 

20 presence or absence of HCV genomes in a sample. On the other hand, the variation 
of the HCV genomes is dispersed in nature, so rarely is any one probe able to identify 
uniquely a specific HCV genome. Rather, the identity of an HCV genotype may be 
inferred from the pattern of binding of a panel of oligonucleotide probes, which are 
specific for (different) segments of the different HCV genomes. Depending on the 

25 choice of these oligonucleotide probes, each known HCV genotype will correspond 
to a specific hybridization pattern upon use of a specific combination of probes. Each 
HCV genotype will also be able to be discriminated from any other HCV genotype 
amplified with the same primers depending on the choice of the oligonucleotide 
probes. Comparison of the generated pattern of positively hybridizing probes for a 

30 sample containing one or more unkown HCV sequences to a scheme of expected 
hybridization patterns, allows one to clearly infer the HCV genotypes present in said 
sample. 

The present invention thus relates to a method as defined above, wherein one 
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or more hybridization probes are selected from any of SEQ ID NO 1, 3, 5, 7, 9, 1 1 , 
13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 
53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 
93, 95, 97, 99, 101, 103 or 105 or sequence variants thereof as defined above. 
5 In order to distinguish the amplified HCV genomes from each other, the target 

polynucleic acids are hybridized to a set of sequence-specific DNA probes targeting 
HCV genotypic regions (unique regions) located in the HCV polynucleic acids. 

Most of these probes target the most type- or subtype-specific regions of HCV 
genotypes, but some can be caused to hybridize to more than one HCV genotype. 
1 0 According to the hybridization solution (SSC, SSPE, etc.), these probes should 

be stringently hybridized at their appropriate temperature in order to attain sufficient 
specificity. However, by slightly modifying the DNA probes, either by adding or 
deleting one or a few nucleotides at their extremities (either 3' or 5'), or substituting 
some non-essential nucleotides (i.e. nucleotides not essential to discriminate between 
1 5 types) by others (including modified nucleotides or inosine) these probes or variants 
thereof can be caused to hybridize specifically at the same hybridization conditions 
(i.e. the same temperature and the same hybridization solution). Also changing the 
amount (concentration) of probe used may be beneficial to obtain more specific 
hybridization results. It should be noted in this context, that probes of the same 
20 length, regardless of their GC content, will hybridize specifically at approximately the 
same temperature in TMACI solutions (Jacobs et al., 1988). 

Suitable assay methods for purposes of the present invention to detect hybrids 
formed between the oligonucleotide probes and the nucleic acid sequences in a 
sample may comprise any of the assay formats known in the art, such as the 
25 conventional dot-blot format, sandwich hybridization or reverse hybridization. For 
example, the detection can be accomplished using a dot blot format, the unlabeled 
amplified sample being bound to a membrane, the membrane being incorporated with 
at least one labelled probe under suitable hybridization and wash conditions, and the 
presence of bound probe being monitored. 
30 An alternative and preferred method is a "reverse" dot-blot format, in which 

the amplified sequence contains a label. In this format, the unlabelled oligonucleotide 
probes are bound to a solid support and exposed to the labelled sample under 
appropriate stringent hybridization and subsequent washing conditions. It is to be 
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understood that also any other assay method which relies on the formation of a 
hybrid between the nucleic acids of the sample and the oligonucleotide probes 
according to the present invention may be used. 

According to an advantageous embodiment, the process of detecting one or 
5 more HCV genotypes contained in a biological sample comprises the steps of 
contacting amplified HCV nucleic acid copies derived from the biological sample, 
with oligonucleotide probes which have been immobilized as parallel lines on a solid 
support. 

According to this advantageous method, the probes are immobilized in a Line 

10 Probe Assay (LiPA) format. This is a reverse hybridization format (Saiki et al., 1 989) 
using membrane strips onto which several oligonucleotide probes (including negative 
or positive control oligonucleotides) can be conveniently applied as parallel lines. 

The invention thus also relates to a solid support, preferably a membrane strip, 
carrying on its surface, one or more probes as defined above, coupled to the support 

15 in the form of parallel lines. 

The LiPA is a very rapid and user-friendly hybridization test. Results can be 
read after 4 hours, after the start of the amplification. After amplification during 
which usually a non-isotopic label is incorporated in the amplified product, and 
alkaline denaturation, the amplified product is contacted with the probes on the 

20 membrane and the hybridization is carried out for about 1 to 1,5 h hybridized 
polynucleic acid is detected. From the hybridization pattern generated, the HCV type 
can be deduced either visually, but preferably using dedicated software. The LiPA 
format is completely compatible with commercially available scanning devices, thus 
rendering automatic interpretation of the results very reliable. All those advantages 

25 make the LiPA format liable for the use of HCV detection in a routine setting. The 
LiPA format should be particularly advantageous for detecting the presence of 
different HCV genotypes. 

The present invention also relates to a method for detecting and identifying 
novel HCV genotypes, different from the known HCV genomes, comprising the steps 

30 of: 

- determining to which HCV genotype the nucleotides present in a biological 
sample belong, according to the process as defined above, 

- in the case of observing a sample which does not generate a hybridization 
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pattern compatible with those defined in Table 3, sequencing the portion 
of the HCV genome sequence corresponding to the aberrantly hybridizing 
probe of the new HCV genotype to be determined. 
The present invention also relates to a method for preparing a polynucleic acid 
5 according to the present invention. These methods include any method known in the 
art for preparing polynucleic acids (e.g. the phosphodiester method for synthesizing 
oligonucleotides as described by Agarwal et al. 1972, Agnew. Chem. Int. Ed. Engl. 
11:451, the phosphotriester method of Hsiung et al. 1979, Nucleic Acid Res. 
6:1371, or the automated diethylphosphoramidite method of Baeucage et al. 1981, 
10 Tetrahedron Letters 22:1859-1862.). Alternatively, the polynucleic acids of the 
present invention may be isolated fragments of naturally occuring or cloned DNA or 
RNA. In addition, the oligonucleotides according to the present invention may be 
synthesized automatically on commercial instruments sold by a variety of 
manufacturers. 

1 5 The present invention particularly also relates to a polypeptide having an amino 

acid sequence encoded by a polynucleic acid as defined above, or a part thereof 
which is unique to at least one of the HCV subtypes or types as defined in Table 5, 
and which contains at least one amino acid differing from any of the known HCV 
types or subtypes, or an analog thereof being substantially homologous and 

20 biologically equivalent . 

The term 'polypeptide 1 refers to a polymer of amino acids and does not refer 
to a specific length of the product; thus, peptides, oligopeptides, and proteins are 
included within the definition of polypeptide. This term also does not refer to or 
exclude post-expression modifications of the polypeptide, for example, 

25 glycosylations, acetylations, phosphorylations and the like. Included within the 
definition are, for example, polypeptides containing one or more analogues of an 
amino acid (including, for example, unnatural amino acids, PNA, etc.), polypeptides 
with substituted linkages, as well as other modifications known in the art, both 
naturally occurring and non-naturally occurring. 

30 The term "unique" is referred above. 

By "biologically equivalent" as used throughout the specification and claims, 
it is meant that the compositions are immunogenically equivalent to the proteins 
(polypeptides) or peptides of the invention as defined above and below. 
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By "substantially homologous" as used throughout the ensuing specification 
and claims to describe proteins and peptides, it is meant a degree of homology in the 
amino acid sequence to the proteins or peptides of the invention. Preferably the 
degree of homology is in excess of 90, preferably in excess of 95, with a particularly 
5 preferred group of proteins being in excess of 99 homologous with the proteins or 
peptides of the invention. 

The term "analog" as used throughout the specification or claims to describe 
the proteins or peptides of the present invention, includes any protein or peptide 
having an amino acid residue sequence substantially identical to a sequence 

10 specifically shown herein in which one or more residues have been conservatively 
substituted with a biologically equivalent residue. Examples of conservative 
substitutions include the substitution of one-polar (hydrophobic) residue such as 
isoleucine, valine, leucine or methionine for another, the substitution of one polar 
(hydrophillic) residue for another such as between arginine and lysine, between 

1 5 glutamine and asparagine, between glycine and serine, the substitution of one basic 
residue such as lysine, arginine or histidine for another, or the substitution of one 
acidic residue, such as aspartic acid or glutamic acid for another. Examples of 
allowable mutations acccording to the present inevntion can be found in Table 4. 
The phrase "conservative substitution" also includes the use of a chemically 

20 derivatized residue in place of a non-derivatized residue provided that the resulting 
protein or peptide is biologically equivalent to theprotein or peptide of the invention. 

"Chemical derivative" refers to a protein or peptide having one or more 
residues chemically derivatized by reaction of a functional side group. Examples of 
such derivatized molecules, include but are not limited to, those molecules in which 

25 free amino groups have been derivatized to form amine hydrochlorides, p-toluene 
sulfonyl groups, carbobenzoxy groups, t-butyloxycarbonyl groups, chloracetyl groups 
or formyl groups. Free carboxyl groups may be derivatized to form salts, methyl and 
ethyl esters or other types of esters or hydrazides. Free hydroxyl groups may be 
derivatized to form O-acyl or O-alkyl derivatives. The imidazole nitrogen of histidine 

30 may be derivatized to form N-imbenzylhistidine. Also included as chemical derivatives 
are those proteins or peptides which contain one or more naturally-occurring amino 
acid derivatives of the twenty standard amino acids. For examples : 4-hydroxyproline 
may be substituted for proline; 5-hydroxylysine may be substituted for lysine; 3- 
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methylhistidine may be substituted for histidine; homoserine may be substituted for 
serine; and ornithine may be substituted for lysine. The proteins or peptides of the 
present invention also include any protein or peptide having one or more additions 
and/or deletions or residues relative to the sequence of a peptide whose sequence 
5 is shown herein, so long as the peptide is biologically equivalent to the proteins or 
peptides of the invention. 

It is to be noted that, at the level of the amino acid sequence, at least one 
amino acids difference (with respect to known HCV amino acid sequences) is 
sufficient to be part of the invention, which means that the polypeptides of the 

1 0 invention correspond to polynucleic acids having at least one nucleotide difference 
(with known HCV polynucleic acid sequences) involving an amino acid difference in 
the encoded polyprotein. 

As the NS4 and the Core regions are known to contain several epitopes, for 
example characterized in patent application EP-A-0 489 968, and as the E1 protein 

15 is expected to be subject to immune attack as part of the viral envelope and 
expected to contain epitopes, the NS4, Core and E1 epitopes of the new types and 
subtypes disclosed herein will consistently differ from the epitopes present in 
previously known genotypes. This is examplified by the type-specificity of NS4 
synthetic peptides as described in Simmonds et al. (1993c) and Stuyver et al. 

20 (1 993b) and PCT/EP 94/01 323 and the type-specificity of recombinant E1 proteins 
as described in Maertens et al. (1994). 

The peptides according to the present invention contain preferably at least 3, 
preferably 4, 5 contiguous HCV amino acids, 6, 7 preferably however at least 8 
contiguous HCV amino acids, at least 10 or at least 15 (for instance at least 9, 10, 



25 1 1 , 1 2, 1 3, 1 4, 1 5, 1 6, 1 7, 1 9, 20, 21 , 22, 23, 24, 25, 30, 35, 40, 45, 50 or more 



amino acids). 



TABLE 4 



Amino acids 



Synonymous groups 



30 



Ser (S) 
Arg (R) 



Ser, Thr, Gly, Asn 
Arg, His, Lys, Glu, Gin 
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10 



15 



Leu (L) 
Pro (P) 
Thr (T) 
Ala (A) 
Val (V) 
Gly (G) 
lie (I) 
Phe (F) 
Tyr (Y) 
Cys (C) 
His (H) 
Gin (Q) 
Asn (N) 
Lys {K) 
Asp (D) 
Glu (E) 
Met (M) 



Leu; lie, Met, Phe, Val, Tyr 

Pro, Ala, Thr, Gly 

Thr, Pro, Ser, Ala, Gly, His, Gin 

Ala, Pro, Gly, Thr 

Val, Met, lie, Tyr, Phe, Leu, Val 

Gly, Ala, Thr, Pro, Ser 

lie, Met, Leu, Phe, Val, lie, Tyr 

Phe, Met, Tyr, lie, Leu, Trp, Val 

Tyr, Phe, Trp, Met, lie, Val, Leu 

Cys, Ser, Thr, Met 

His, Gin, Arg, Lys, Glu, Thr 

Gin, Glu, His, Lys, Asn, Thr, Arg 

Asn, Asp, Ser, Gin 

Lys, Arg, Glu, Gin, His 

Asp, Asn, Glu, Gin 

Glu, Gin, Asp, Lys, Asn, His, Arg 

Met, lie, Leu, Phe, Val 



Table 4 Overview of the amino acid substitutions which could form the basis 
20 of analogs (muteins) as defined above 

The polypeptides of the invention, and particularly the fragments, can be 
prepared by classical chemical synthesis. 

The synthesis can be carried out in homogeneous solution or in solid phase. 

For instance, the synthesis technique in homogeneous solution which can be 
25 used is the one described by Houbenweyl in the book entitled "Methode der 
organischen chemie" {Method of organic chemistry) edited by E. Wunsh, vol. 1 5-I et 
II. THIEME, Stuttgart 1974. 

The polypeptides of the invention can also be prepared in solid phase 
according to the methods described by Atherton and Shepard in their book entitled 
30 "Solid phase peptide synthesis" (IRL Press, Oxford, 1989). 

The polypeptides according to this invention can be prepared by means of 
recombinant DNA techniques as described by Maniatis et al.. Molecular Cloning: A 
Laboratory Manual, New York, Cold Spring Harbor Laboratory, 1982). 

The present invention relates particularly to a polypeptide as defined above, 
35 comprising in its amino acid sequence at least one of the following amino acid 
residues: 
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115, C38, V44, A49, Q43, P49, Q55, A58, S60 or D60, E68 or V68, H70, A71 or 
Q71 or N71, D72, H81, H101, D106, S110, L130, 1134, E135, L140, S148,T150 
orE150, Q153, F155, D157, G160, E165, 1169, F181, L186, T190, T192 or 1192 
or H192, 1193, A195, S1 96, R197 or N197 or K197, Q199 or D199 or H199, 
5 N1 99, F200 or T200, A208, 121 3, M21 6 or S21 6, N21 7 or S21 7 or G21 7 or K21 7, 
T218, 1219, A222, Y223, I230, W231 or L231, S232 or H232 or A232, Q233, 
E235 or L235, F236 or T236, F237, L240 or M240, A242, N244, N249, 1250 or 
K250 or R250, A252 or C252, A254, 1255 or V255, D256 or M256, E257, E260 
or K260, R261, V268, S272 or R272, I285, G290 or F290, A291, A293 or L293 

10 or W293, T294 or A294, S295, H295, K296 or E296, Y297 or M297, I299 or 
Y299, I300, S301, P316, S2646, A2648, G2649, A2650, V2652, Q2653, H2656 
or L2656, D2657, F2659, K2663 or Q2663, A2667 or V2667, D2677, L2681, 
M2686 or Q2686 or E2686, A2692 or K2692, H2697, I2707, L2708 or Y2708, 
A2709, A2719 or M2719, F2727, T2728 or D2728, E2729, F2730 or Y2730, 

15 12741, I2745, V2746 or E2746 or L2746 or K2746, A2748, S2749 or P2749, 
R2750, E2751 , D2752 or N2752 or S2752 or T2752 or V2752 or I2752 or Q2752, 
S2753 or D2753 or G2753, D2754, A2755, L2756 or Q2756, or R2757, 

with said notation being composed of a letter representing the amino acid 
residue by its one-letter code, and a number representing the amino acid numbering 

20 according to Kato et al., 1990 as shown in Table 1 (see also the numbering in 
Figures 2, 4 and 6), 

or a part thereof which is unique to at least one of the HCV subtypes or types as 
defined in Table 5, and which contains at least one amino acid differing from any of 
the known HCV types or subtypes, or an analog thereof being substantially 

25 homologous and biologically equivalent to said polypeptide or part thereof. 

These unique amino acid residues can be deduced from aligning the new HCV 
amino acid sequences as given in Figure 3 to all known HCV sequences. An 
alignment with the new sequences as represented in SEQ ID NO 1 to 106 is given 
in for instance Figures 2, 4 and 6. It should be clear that the alignments given in 

30 these figures may be completed with all known HCV sequences to illustrate that any 
of the above-given unique residues is indeed unique for at least one of the new HCV 
sequences of the present invention. 

Within the group of unique and new amino acid residues of the present 
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invention, unique residues may be found which are specific for the following new 
types (subtypes) of HCV according to the HCV classification system used in the 
present invention: type 1 subtype Id, 1e, 1f or 1g isolates; type 2 subtype 2e, 2f, 
2g, 2h, 2i, 2k or 21 isolates; type 3 subtype 3g isolates; type 4 subtype 4k, 41 or 4m 
5 isolates; type 7 subtype 7a, 7c or 7d isolates, type 9, type 10 or type 1 1 isolates. 
In order to obtain these residues the alignments given in Figures 2, 4 and 6 may be 
used to deduce the type- and or subtype-specificity of any of the unique residues 
given above. 

For example T190 (detected in subtype 1d) refers to a threomine at position 
10 190 (see Figure 2). In other sequences only a serine (S190) or exceptionally an 
alanine (A190 in type 10a) can be detected. 

The polypeptides according to this embodiment of the invention may be 
possibly labelled, or attached to a solid substrate, or coupled to a carrier molecule 
such as biotin, or mixed with a proper adjuvant all known in the art and according 
15 to the intended use (diagnostic, therapeutic or prophylactic). 

The present invention also relates to a polypeptide as defined above, 
comprising in its amino acid sequence at least one of the sequences repesented by 
SEQ ID N0 107 to 207 as listed above, or a part thereof which is unique to at least 
one of the HCV subtypes or types as defined in Table 5, or an analog thereof being 
20 substantially homologous and biologically equivalent to said polypeptide or part 
thereof. 

The present invention relates also to a polypeptide having an amino acid 
sequence as represented in any of SEQ ID NO 1 to 106, or a part thereof which is 
unique to at least one of the HCV subtypes or types as defined in Table 5, or an 
25 analog thereof being substantially homologous and biologically equivalent to said 
polypeptide or part thereof. 

The variable region in the core protein (V-CORE in Fig. 2) has been shown to 
be useful for serotyping (Machida et al., 1992). The sequence of the type 1 subtype 
Id, 1e, 1f or 1g sequence; type 2 subtype 2e, 2f, 2g, 2h, 2i, 2k and 21 sequence; 
30 type 3 subtype 3g; type 4, subtype 4k, 41 or 4m sequence; type 7 (subtype 7a, 7c 
and 7d sequences), 9, 10 or 11 sequences of the present invention show type- 
specific features in this region. The peptide from amino acid 68 to 78 (V-core region) 
shows the following unique sequence for the sequences of the present invention (see 
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figure 2): 

ARQSDGRSWAQ or ARRSEGRSWAQ as for subtype 1d (SEQ ID NO 107 and 
108) 



ERRPEGRSWAQ as for subtype 1e 


(SEQ 


ID NO 


109} 


ARRPEGRSWAQ as for subtype 1f 


(SEQ 


ID NO 


110) 


DRRTTGKSWGR as for subtype 2k 


(SEQ 


ID NO 


1 11) 


DRRATGRSWGR as for subtype 2e 


(SEQ 


ID NO 


112) 


DRRATGKSWGR as for subtype 2f 


(SEQ 


ID NO 


113) 


VRQPTGRSWGQ as for type 9 


(SEQ 


ID NO 


114) 


VRHQTGRTWAQ as for subtype 7a and 7c 


(SEQ 


ID NO 


115) 


VRQNQGRTWAQ as for subtype 7d 


(SEQ 


ID NO 


116) 


ARRTEGRSWAQ as for type 10 




(SEQ 


ID NO 117) 


VRRTTGRXXXX or VRRTTGRTWAQ as for type 1 1 


(SEQ ID NO 118 and 



119) 

15 Five type-specific variable regions (V1 to V5) can be identified after aligning 

E1 amino acid sequences of the genotypes of the present invention to the genotypes 
already known, as shown in Figure 2. 

Region V1 encompasses amino acids 192 to 203, this is the amino-terminal 
10 amino acids of the E1 protein. The following unique 
20 sequences as shown in Fig. 2 can be deduced: 

HEVRNASGVYHV or HEVRNASGVYHL as for subtype 1d, (SEQ ID NO 
120 and 121) 

YEVHSTTDGYHV as for subtype 1f (SEQ ID NO 122) 

VEVKNTSQAYMA as for subtype 2e (SEQ ID NO 123) 

25 IQVKNNSHFYMA as for subtype 2f (SEQ ID NO 124) 

VQVKNTSTMYMA as for subtype 2g ' (SEQ ID NO 125) 

VQVKNTSHSYMV as for subtype 2h (SEQ ID NO 126) 

VQVANRSGSYMV as for subtype 2i (SEQ ID NO 1 27) 

VEIKNTXNTYVL or VEIKNTSNTYVL as for subtype 2k (SEQ ID NO 1 28 

30 and 129) 

INYRNVSGIYYV or INYRNTSGIYHV or INYHNTSGIYHI or TNYRNVSGIYHV a 
for subtype 4k (SEQ ID NO 130, 131, 132 or 133) 

QHYRNVSGIYHV as for subtype 41 (SEQ ID NO 134) 
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IQVKNASGIYHL as for type 9 (SEQ ID NO 135) 

AHYTNKSGLYHL as for subtype 7c (SEQ ID NO 136) 
LNYANKSGLYHL as for subtype 7d (SEQ ID NO 137) 
LEYRNASGLYMV as for type 1 0 (SEQ ID NO 1 38) 

5 Region V2 encompasses amino acids 213 to 223. The following unique 

sequences can be found in the V2 region as shown in Figure 2: 

IYEMDGMIMHY or IYEMSGMILHA as for subtype 1d, (SEQ ID NO 139 
and 140) 

VYEAKDIILHT as for subtype 1f (SEQ ID NO 141) 

10 VWQLXDAVLHV as for subtype 2e (SEQ ID NO 142) 

VWQLRDAVLHV as for subtype 2f (SEQ ID NO 143) 
IWQMQGAVLHV as for subtupe 2g (SEQ ID NO 144) 
VWQLKDAVLHV as for subtype 2h (SEQ ID NO 145) 
VWQLEEAVLHV as for subtype 2i (SEQ ID NO 146) 

1 5 TWQLXXAVLHV as for subtype 2k (SEQ ID NO 147) 

VYEADHHILHL or VYEADHHILAL or VFEADHHILHL as for subtype 4k 

(SEQ ID NO 148, 149 and 150) 
VYESDHHILHL as for subtype 41 (SEQ ID NO 151) 

VFEAETMILHL as for type 9 (SEQ ID NO 1 52) 

20 VYEAETLILHL as for subtype 7c (SEQ ID NO 153) 

VYEANGMILHL as for subtype 7d (SEQ ID NO 1 54) 
VYEAGDIILHL as for type 1 0. (SEQ ID NO 1 55) 

Region V3 encompasses the amino acids 230 to 242. The following unique 
V3 region sequences can be deduced from Figure 2: 

25 VREDNHLRCWMAL or VRENNSSRCWMAL as for subtype 1d 

(SEQ ID NO 156 and 157) 



30 



IREGNISRCWVLP as for subtype 1f 


(SEQ 


ID 


NO 


158) 


ENSSGRFHCWIPI as for subtype 2e 


(SEQ 


ID 


NO 


159) 


ERSGNRTFCWTAV as for subtype 2f 


(SEQ 


ID 


NO 


160) 


ELQGNKSRCWIPV as for subtype 2g 


(SEQ 


ID 


NO 


162) 


ERHQNQSRCWIPV as for subtype 2h 


(SEQ 


ID 


NO 


163) 


EWKDNTSRCW1PV as for subtype 2i 


(SEQ 


ID 


NO 


164) 


EREGNSSRCWIPV as for subtype 2k 


(SEQ 


ID 


NO 


165) 
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VREGNQSRCWVAL or VRTGNQSRCWVAL or VRVGNQSSCWVAL or 
VRVGNQSRCWVAL or VKEGNHSRCWVAL as for subtype 4k 

(SEQ ID NO 166, 167, 168 or 169) 
VKTGNTSRCWVAL as for subtype 41 (SEQ ID NO 170) 
IKAGNESRCWLPV as for type 9 (SEQ ID NO 171) 

VKXXNQSRCWVQA as for subtype 7c (SEQ ID NO 1 72) 
VKTGNLTKCWLSA as for subtype 7d (SEQ ID NO 173) 
VRSGNTSRCWIPV as for type 10 (SEQ ID NO 174) 

Region V4 encompasses the amino acids 248 to 257. The following unique 
V4 region sequences can be deduced from figure 2: 

VKNASVPTAA or VKDANVPTAA as for subtype 1 d (SEQ ID NO 1 75 and 1 76) 

ARIANAPIDE as for subtype 1f (SEQ ID NO 177) 

VSKPGALTKG as for subtype 2e (SEQ ID NO 178) 

VSRPGALTRG as for subtype 2f (SEQ ID NO 179) 

VNQPGALTRG as for subtype 2g (SEQ ID NO 180) 

VSQPGALTRG as for subtype 2h (SEQ ID NO 181) 

VSQPGALTKG as for subtype 2i (SEQ ID NO 182) 

VSRPGALTEG as for subtype 2k (SEQ ID NO 183) 

APYIGAPLES or APYTAAPLES as for subtype 4k (SEQ ID NO 1 84 
and 185) 

APILSAPLMS as for subtype 41 (SEQ ID NO 1 86) 

VPNSSVPIHG as for type 9 (SEQ ID NO 187) 

VPNASTPVTG as for subtype 7c (SEQ ID NO 188) 

VQNASVSIRG as for subtype 7d (SEQ ID NO 189) 

VKSPCAATAS as for type 1 0 (SEQ ID NO 1 90) 

Region V5 encompasses the amino acids 294 to 303. The following unique 
V5 region peptides can be deduced from figure 2: 

SPRMHHTTQE or SPRLYHTTQE as for subtype 1d (SEQ ID NO 191 

and 1 92) 

TSRRHWTVQD as for subtype 1f (SEQ ID NO 193) 

APKRHYFVQE as for subtype 2e (SEQ ID NO 1 94) 

SPQYHTFVQE as for subtype 2f (SEQ ID NO 1 95) 

SPQHHNFSQD as for subtype 2g (SEQ ID NO 196) 
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SPQHHIFVQD as for subtype 2h (SEQ ID NO 197) 

SPEHHHFVQD as for subtype 2k (SEQ ID NO 198) 

RPRRHWTTQD or RPRRHWTAQD or QPRRHWTTQD or RPRRHWTTQE as for 
subtype 4k (SEQ ID NO 199, 200, 201 or 202) 

5 QPRRHWTVQD as for subtype 41 (SEQ ID NO 203) 

RPKYHQVTQD as for type 9 (SEQ ID NO 204) 

RPRMHQVVQE as for subtype 7c (SEQ ID NO 205) 

RPRMYEIAQD as for subtype 7d (SEQ ID NO 206) 

RHRQHWTVQD as for type 10 (SEQ ID NO 207) 

10 The above given list of peptides are particularly useful for treatment and 

vaccine and diagnostic development. 

Also comprised in the present invention is any synthetic peptide (see below) 
or polypeptide containing at least an epitope derived from the above-defined peptides 
in their peptidic chain. Also comprised within the present invention is any synthetic 
15 peptide or polypeptide comprising at least 6, 7, 8, or 9 contiguous amino acids 
derived from the above-defined peptides in their peptidic chain. 

As used herein, 'epitope' or 'antigenic determinant' means an amino 
acid sequence that is immunoreactive. Generally an epitope consists of at least 3 to 
4 amino acids, and more usually, consists of at least 5 or 6 amino acids, sometimes 
20 the epitope consists of about 7 to 8, or even about 1 0 amino acids. 

The present invention particularly relates to any peptide (see below) or 
polypeptide contained in any of the amino acid sequences as represented in SEQ ID 
NO 2, 4, 7, 9, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 
46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 
25 86, 88, 90, 92, 94, 96, 98, 100, 102, 104 or 106 (see Table 5 and Figure 3, 
Examples section). 

The present invention also relates to a recombinant polypeptide encoded by 
a polynucleic acid as defined above, or a part thereof which is unique to any of the 
HCV subtypes or types as defined in Table 5, or an analog thereof being substantially 
30 homologous and biologically equivalent to said polypeptide. 

The present invention also relates to a recombinant expression vector 
comprising a polynucleic acid or a part thereof as defined above, operably linked to 
prokaryotic, eukaryotic or viral transcription and translation control elements. 
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In general said recombinant vector will comprise a vector sequence, an 
appropriate prokaryotic, eukaryotic or viral promoter sequence followed by the 
nucleotide sequences as defined above, with said recombinant vector allowing the 
expression of any one of the polypeptides as defined above in a prokaryotic, or 
eukaryotic host or in living mammals when injected as naked DNA, and more 
particularly a recombinant vector allowing the expression of any of the new HCV 
sequences of the invention spanning particularly the following amino acid positions: 

- a polypeptide starting in the region between positions 1 and 10 and ending 
at any position in the region between positions 70 and 420, more 
particularly a polypeptide spanning positions 1 to 70, 1 to 85, positions 1 
to 120, positions 1 to 150, positions 1 to 191, or positions 1 to 200, for 
expression of the Core protein, and a polypeptide spanning positions 1 to 
263, positions 1 to 326, positions 1 to 383, or positions 1 to 420 for 
expression of the Core and E1 protein; 

- a polypeptide starting at any position in the region between positions 1 1 7 
and 1 92, and ending at any position in the region between positions 263 
and 420, for expression of E1 , or forms that have the hydrophobic region 
deleted (positions 264 to 293 plus or minus 8 amino acids); 

- a polypeptide starting at any position in the region between positions 1 556 
and 1 688, and ending at any position in the region between positions 1 739 
and 1764, for expression of NS4, more particularly ;a polypeptide starting 
at position 1658 and ending at position 1711, for expression of NS4a 
antigen, and more particularly, a polypeptide starting at position 1712 and 
ending in the region between positions 1 743 and 1 972 (for instance 1712- 
1743, 1712-1764, 1712-1782, 1712-1972, 1712-1782, 1 71 2-1 902), for 
expression of NS4b antigen or parts thereof. 

Any other HCV vector construction known in the art may also be used for the 
recombinant polypeptides of the present invention. 

Also any of the known purification methods for recombinant proteins may be 
used for the production of the recombinant polypeptides of the present invention, 
particularly the HCV recombinant polypeptide purification methods as disclosed in 
PCT/EP 95/03031 in name of innogenetics N.V. 

The term "vector" may comprise a plasmid, a cosmid, a phage, or a virus or 
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a transgenic animal. Particularly useful for vaccine development may be BCG or 
adenoviral vectors, as well as avipox recombinant viruses. 

The present invention also relates to a method for the production of a 
recombinant polypeptide as defined above, comprising: 
5 - transformation of an appropriate cellular host with a recombinant vector, in 

which a polynucleic acid or a part thereof according to as defined above has 

been inserted under the control of appropriate regulatory elements, 

culturing said transformed cellular host under conditions enabling the 

expression of said insert, and, 
1 0 - harvesting said polypeptide. 

The term 'recombinant^ expressed 1 used within the context of the present 
invention refers to the fact that the proteins of the present invention are produced 
by recombinant expression methods be it in prokaryotes, or lower or higher 
eukaryotes as discussed in detail below. 
15 The term Mower eukaryote' refers to host cells such as yeast, fungi and the 

like. Lower eukaryotes are generally (but not necessarily) unicellular. Preferred lower 
eukaryotes are yeasts, particularly species within Saccharomvces , 
Schizosaccharomvces , Kluveromvces , Pichia (e.g. Pichia pastoris ), Hansenula (e.g. 
Hansenula polvmorpha ), Yarowia , Schwaniomvces , Schizosaccharomvces , 
20 Zygosaccharomvces and the like. Saccharomvces cerevisiae , S. carlsbergensis and 
K. lactis are the most commonly used yeast hosts, and are convenient fungal hosts. 

The term 'prokaryotes 1 refers to hosts such as E.coli , Lactobacillus , 
Lactococcus , Salmonella , Streptococcus , Bacillus subtilis or Streptomyces . Also 
these hosts are contemplated within the present invention. 
25 The term 'higher eukaryote 1 refers to host cells derived from higher animals, 

such as mammals, reptiles, insects, and the like. Presently preferred higher eukaryote 
host cells are derived from Chinese hamster (e.g. CHO), monkey (e.g. COS and Vero 
cells), baby hamster kidney (BHK), pig kidney (PK1 5), rabbit kidney 1 3 cells (RK1 3), 
the human osteosarcoma cell line 143 B, the human cell line HeLa and human 
30 hepatoma cell lines like Hep G2, and insect cell lines (e.g. Spodoptera fruqiperda ). 
The host cells may be provided in suspension or flask cultures, tissue cultures, organ 
cultures and the like. Alternatively the host cells may also be transgenic animals. 

The term 'recombinant polynucleotide or nucleic acid 1 intends a polynucleotide 
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or nucleic acid of genomic, cDNA, semisynthetic, or synthetic origin which, by virtue 
of its origin or manipulation : (1) is not associated with all or a portion of a 
polynucleotide with which it is associated in nature, (2) is linked to a polynucleotide 
other than that to which it is linked in nature, or (3) does not occur in nature. 

The term "recombinant host cells', 'host cells', 'cells', 'cell lines', 'cell 
cultures', and other such terms denoting microorganisms or higher eukaryotic cell 
lines cultured as unicellular entities refer to cells which can be or have been, used 
as recipients for a recombinant vector or other transfer polynucleotide, and include 
the progeny of the original cell which has been transfected. It is understood that the 
progeny of a single parental cell may not necessarily be completely identical in 
morphology or in genomic or total DNA complement as the original parent, due to 
natural, accidental, or deliberate mutation. 

The term 'replicon' is any genetic element, e.g., a plasmid, a chromosome, a 
virus, a cosmid, etc., that behaves as an autonomous unit of polynucleotide 
replication within a cell; i.e., capable of replication under its own control. 

The term 'vector' is a replicon further comprising sequences providing 
replication and/or expression of a desired open reading frame. 

The term 'control sequence' refers to polynucleotide sequences which are 
necessary to effect the expression of coding sequences to which they are ligated. 
The nature of such control sequences differs depending upon the host organism; in 
prokaryotes, such control sequences generally include promoter, ribosomal binding 
site, splicing sites and terminators; in eukaryotes, generally, such control sequences 
include promoters, splicing sites, terminators and, in some instances, enhancers. The 
term "control sequences' is intended to include, at a minimum, all components whose 
presence is necessary for expression, and may also include additional components 
whose presence is advantageous, for example, leader sequences which govern 
secretion. 

The term 'promoter' is a nucleotide sequence which is comprised of consensus 
sequences which allow the binding of RNA polymerase to the DNA template in a 
manner such that mRNA production initiates at the normal transcription initiation site 

for the adjacent structural gene. 

The expression 'operably linked' refers to a juxtaposition wherein the 
components so described are in a relationship permitting them to function in their 
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intended manner. A control sequence 'operably linked' to a coding sequence is 
ligated in such a way that expression of the coding sequence is achieved under 
conditions compatible with the control sequences. 

The segment of the HCV cDNA encoding the desired sequence inserted into 
5 the vector sequence may be attached to a signal sequence. Said signal sequence 
may be that from a non-HCV source, e.g. the IgG or tissue plasminogen activator 
(tpa) leader sequence for expression in mammalian cells, or the a-mating factor 
sequence for expression into yeast cells, but particularly preferred constructs 
according to the present invention contain signal sequences appearing in the HCV 

10 genome before the respective start points of the proteins. 

A variety of vectors may be used to obtain recombinant expression of HCV 
single or specific oiigomeric envelope proteins of the present invention. Lower 
eukaryotes such as yeasts and glycosylation mutant strains are typically transformed 
with plasmids, or are transformed with a recombinant virus. The vectors may 

1 5 replicate within the host independently, or may integrate into the host cell genome. 

Higher eukaryotes may be transformed with vectors, or may be infected with 
a recombinant virus, for example a recombinant vaccinia virus. Techniques and 
vectors for the insertion of foreign DNA into vaccinia virus are well known in the art, 
and utilize, for example homologous recombination. A wide variety of viral promoter 

20 sequences, possibly terminator sequences and poly(A)-addition sequences, possibly 
enhancer sequences and possibly amplification sequences, all required for the 
mammalian expression, are available in the art. Vaccinia is particularly preferred since 
vaccinia halts the expression of host cell proteins. Vaccinia is also very much 
preferred since it allows the expression of f.i. E1 and E2 proteins of HCV in cells or 

25 individuals which are immunized with the live recombinant vaccinia virus. For 
vaccination of humans the avipox and Ankara Modified Virus (AMV) are particularly 
useful vectors. 

Also known are insect expression transfer vectors derived from baculovirus 
Autograoha californica nuclear polyhedrosis virus (AcNPV), which is a helper- 
30 independent viral expression vector. Expression vectors derived from this system 
usually use the strong viral polyhedrin gene promoter to drive the expression of 
heterologous genes. Different vectors as well as methods for the introduction of 
heterologous DNA into the desired site of baculovirus are available to the man skilled 
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in the art for baculovirus expression. Also different signals for posttranslational 
modification recognized by insect cells are known in the art. 

The present invention also relates to a host cell transformed with a 
recombinant vector as defined above. 
5 The present invention also relates to a method for detecting antibodies to HCV 

present in a biological sample, comprising: 

(i) contacting the biological sample to be analysed for the presence of HCV with a 
polypeptide as defined above, 

(ii) detecting the immunological complex formed between said antibodies and said 
1 0 polypeptide. 

The present invention also relates to a method for HCV typing, comprising: 

(i) contacting the biological sample to be analysed for the presence of HCV with a 
polypeptide as defined above, 

(ii) detecting the immunological complex formed between said antibodies and said 
15 polypeptide. 

The present invention also relates to a diagnostic kit for use in detecting the 
presence of HCV, said kit comprising at least one polypeptide as defined above, with 
said polypeptide being preferably bound to a solid support. 

The present invention also relates to a diagnostic kit for HCV typing, said kit 
20 comprising at least one polypeptide as defined above, with said polypeptide being 
preferably bound to a solid support. 

The present invention also relates to diagnostic kit according as defined above, 
said kit comprising a range of said polypeptides which are attached to specific 
locations on a solid substrate. 
25 The present invention also relates to a diagnostic kit as defined above, wherein 

said solid support is a membrane strip and said polypeptides are coupled to the 
membrane in the form of parallel lines. 

The immunoassay methods according to the present invention may utilize 
antigens from the different domains of the new and unique polypeptide sequences 
30 of the present invention that maintain linear (in case of peptides) and conformational 
epitopes (in case of polypeptides) recognized by antibodies in the sera from 
individuals infected with HCV. It is within the scope of the invention to use for 
instance single or specific oligomeric antigens, dimeric antigens, as well as 
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combinations of single or specific oligomeric antigens. The HCVantigens of the 
present invention may be employed in virtually any assay format that employs a 
known antigen to detect antibodies. Of course, a format that denatures the HCV 
conformational epitope should be avoided or adapted. A common feature of all of 
5 these assays is that the antigen is contacted with the body component suspected 
of containing HCV antibodies under conditions that permit the antigen to bind to any 
such antibody present in the component. Such conditions will typically be physiologic 
temperature, pH and ionic strenght using an excess of antigen. The incubation of the 
antigen with the specimen is followed by detection of immune complexes comprised 

10 of the antigen. 

Design of the immunoassays is subject to a great deal of variation, and many 
formats are known in the art. Protocols may, for example, use solid supports, or 
immunoprecipitation. Most assays involve the use of labeled antibody or polypeptide; 
the labels may be, for example, enzymatic, fluorescent, chemiluminescent, 

1 5 radioactive, or dye molecules. Assays which amplify the signals from the immune 
complex are also known; examples of which are assays which utilize biotin and 
avidin or streptavidin, and enzyme-labeled and mediated immunoassays, such as 
ELISA assays. 

The immunoassay may be, without limitation, in a heterogeneous or in a 
20 homogeneous format, and of a standard or competitive type. In a heterogeneous 
format, the polypeptide is typically bound to a solid matrix or support to facilitate 
separation of the sample from the polypeptide after incubation. Examples of solid 
supports that can be used are nitrocellulose (e.g., in membrane or microtiter well 
form), polyvinyl chloride (e.g., in sheets or microtiter wells), polystyrene latex (e.g., 
25 in beads or microtiter plates, polyvinylidine fluoride (known as Immunolon™), 
diazotized paper, nylon membranes, activated beads, and Protein A beads. For 
example, Dynatech Immunolon™ 1 or Immunlon™ 2 microtiter plates or 0.25 inch 
polystyrene beads (Precision Plastic Ball) can be used in the heterogeneous format. 
The solid support containing the antigenic polypeptides is typically washed after 
30 separating it from the test sample, and prior to detection of bound antibodies. Both 
standard and competitive formats are know in the art. 

In a homogeneous format, the test sample is incubated with the combination 
of antigens in solution. For example, it may be under conditions that will precipitate 
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any antigen-antibody complexes which are formed. Both standard and competitive 
formats for these assays are known in the art. 

In a standard format, the amount of HCV antibodies in the antibody-antigen 
complexes is directly monitored. This may be accomplished by determining whether 
labeled anti-xenogeneic (e.g. anti-human) antibodies which recognize an epitope on 
anti-HCV antibodies will bind due to complex formation. In a competitive format, the 
amount of HCV antibodies in the sample is deduced by monitoring the competitive 
effect on the binding of a known amount of labeled antibody (or other competing 

ligand) in the complex. 

Complexes formed comprising anti-HCV antibody (or in the case of 
competitive assays, the amount of competing antibody) are detected by any of a 
number of known techniques, depending on the format. For example, unlabeled HCV 
antibodies in the complex may be detected using a conjugate of anti-xenogeneic Ig 
complexed with a label (e.g. an enzyme label). 

In an immunoprecipitation or agglutination assay format the reaction between 
the HCV antigens and the antibody forms a network that precipitates from the 
solution or suspension and forms a visible layer or film of precipitate. If no anti-HCV 
antibody is present in the test specimen, no visible precipitate is formed. 

There currently exist three specific types of particle agglutination (PA) assays. 
These assays are used for the detection of antibodies to various antigens when 
coated to a support. One type of this assay is the hemagglutination assay using red 
blood cells (RBCs) that are sensitized by passively adsorbing antigen (or antibody) to 
the RBC. The addition of specific antigen antibodies present in the body component, 
if any, causes the RBCs coated with the purified antigen to agglutinate. 

To eliminate potential non-specific reactions in the hemagglutination assay, 
two artificial carriers may be used instead of RBC in the PA. The most common of 
these are latex particles. However, gelatin particles may also be used. The assays 
utilizing either of these carriers are based on passive agglutination of the particles 
coated with purified antigens. 

The HCV antigens of the present invention comprised of conformational 
epitopes will typically be packaged in the form of a kit for use in these 
immunoassays. The kit will normally contain in separate containers the native HCV 
antigen, control antibody formulations (positive and/or negative), labeled antibody 



SUBSTITUTE SHEET (RULE 26) 



WO 96/13590 



PCT7EP95/04155 



40 

when the assay format requires the same and signal generating reagents (e.g. 
enzyme substrate) if the label does not generate a signal directly. The native HCV 
antigen may be already bound to a solid matrix or separate with reagents for binding 
it to the matrix. Instructions (e.g. written, tape, CD-ROM, etc.) for carrying out the 
5 assay usually will be included in the kit. 

Immunoassays that utilize the native HCV antigen are useful in screening blood 
for the preparation of a supply from which potentially infective HCV is lacking. The 
method for the preparation of the blood supply comprises the following steps. 
Reacting a body component, preferably blood or a blood component, from the 

10 individual donating blood with HCV polypeptides of the present invention to allow 
an immunological reaction between HCV antibodies, if any, and the HCV antigen. 
Detecting whether anti-HCV antibody - HCV antigen complexes are formed as a 
result of the reacting. Blood contributed to the blood supply is from donors that do 
not exhibit antibodies to the native HCV antigens. 

-, 5 in cases of a positive reactivity to the HCV antigen, it is preferable to repeat 

the immunoassay to lessen the possibility of false positives. For example, in the large 
scale screening of blood for the production of blood products (e.g. blood transfusion, 
plasma, Factor VIII, immunoglobulin, etc.) 'screening' tests are typically formatted 
to increase sensitivity (to insure no contaminated blood passes) at the expense of 

20 specificity; i.e. the false-positive rate is increased. Thus, it is typical to only defer for 
further testing those donors who are 'repeatedly reactive'; i.e. positive in two or 
more runs of the immunoassay on the donated sample. However, for confirmation 
of HCV-positivity, the 'confirmation' tests are typically formatted to increase 
specificity (to insure that no false-positive samples are confirmed) at the expense of 

25 sensitivity. 

The solid phase selected can include polymeric or glass beads, nitrocellulose, 
microparticles, microwells of a reaction tray, test tubes and magnetic beads. The 
signal generating compound can include an enzyme, a luminescent compound, a 
chromogen, a radioactive element and a chemiluminescent compound. Examples of 
30 enzymes include alkaline phosphatase, horseradish peroxidase and beta- 
galactosidase. Examples of enhancer compounds include biotin, anti-biotin and 
avidin. Examples of enhancer compounds binding members include biotin, anti-biotin 
and avidin. In order to block the effects of rheumatoid factor-like substances, the 



SUBSTITUTE SHEET (RULE 26} 



WO 96/13590 



PCT7EP95/04155 



41 

test sample is subjected to conditions sufficient to block the effect of rheumatoid 
factor-like substances. These conditions comprise contacting the test sample with 
a quantity of anti-human IgG to form a mixture, and incubating the mixture for a time 
and under conditions sufficient to form a reaction mixture product substantially free 
5 of rheumatoid factor-like substance. 

The present invention particularly relates to an immunoassay format in which 
the polypeptides (or peptides) of the invention are coupled to a membrane in the 
form of parallel lines . This assay format is particularly advantageous for HCV typing 
purposes. 

1 0 Tne present invention also relates to a pharmaceutical composition comprising 

at least one (recombinant) polypeptides as defined above and a suitable excipient, 
diluent or carrier. 

The present invention also relates to a method of preventing HCV infection, 
comprising administering the pharmaceutical composition as defined above to a 
1 5 mammal in effective amount to stimulate the production of protective antibody or 
protective T-cell response. 

The present invention relates to the use of a composition as defined above in 
a method for preventing HCV infection. 

The present invention further relates to a vaccine for immunizing a mammal 
20 against HCV infection, comprising at least one (recombinant) polypeptide as defined 
above, in a pharmaceutically acceptable carrier. 

The term 'immunogenic' refers to the ability of a substance to cause a 
humoral and/or cellular response, whether alone or when linked to a carrier, in the 
presence or absence of an adjuvant. 'Neutralization' refers to an immune response 
25 that blocks the infectivity, either partially or fully, of an infectious agent. A 'vaccine' 
is an immunogenic composition capable of eliciting protection against HCV, whether 
partial or complete. A vaccine may also be useful for treatment of an individual, in 
which case it is called a therapeutic vaccine. 

The term 'therapeutic' refers to a composition capable of treating HCV 
30 infection. The term 'effective amount' refers to an amount of epitope-bearing 
polypeptide sufficient to induce an immunogenic response in the individual to which 
it is administered, or to otherwise detectably immunoreact in its intended system 
(e.g., immunoassay). Preferably, the effective amount is sufficient to effect 
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treatment, as defined above. The exact amount necessary will vary according to the 
application. For vaccine applications or for the generation of polyclonal antiserum / 
antibodies, for example, the effective amount may vary depending on the species, 
age, and general condition of the individual, the severity of the condition being 
5 treated, the particular polypeptide selected and its mode of administration, etc. It is 
also believed that effective amounts will be found within a relatively large, non- 
critical range. An appropriate effective amount can be readily determined using only 
routine experimentation. Preferred ranges of proteins for prophylaxis of HCV disease 
are 0.01 to 100 //g/dose, preferably 0.1 to 50 //g/dose. Several doses may be 

10 needed per individual in order to achieve a sufficient immune response and 
subsequent protection against HCV disease. 

The present invention also relates to a vaccine as defined above, comprising 
at least one (recombinant) polypeptide as defined above, with said polypeptide being 
unique for at least one of the subtypes or types as defined above* 

15 Said vaccine compositions may include prophylactic as well as therapeutic 

vaccine compositions. 

Pharmaceutically acceptable carriers include any carrier that does not itself 
induce the production of antibodies harmful to the individual receiving the 
composition. Suitable carriers are typically large, slowly metabolized macromoiecules 

20 such as proteins, polysaccharides, polylactic acids, polyglycolic acids, polymeric 
amino acids, amino acid copolymers; and inactive virus particles. Such carriers are 
well known to those of ordinary skill in the art. 

Preferred adjuvants to enhance effectiveness of the composition include, but 
are not limited to : aluminim hydroxide (alum), N-acetyl-muramyl-L-threonyl-D- 

25 isoglutamine (thr-MDP) as found in U.S. Patent No. 4,606,91 8, N-acetyl-normuramyl- 
L-alanyl-D-isoglutamine (nor-MDP), N-acetylmuramyl-L-alanyl-D-isoglutaminyl-L- 
alanine-2-(1 r -2'-dipalmitoyl-sn-glycero-3-hydroxyphosphoryloxy)-ethylamine (MTP-PE) 
and RIBI, which contains three components extracted from bacteria, monophosphoryl 
lipid A, trehalose dimycolate, and cell wall skeleton (MPL + TDM + CWS) in a 2% 

30 squalene/Tween 80 emulsion. Any of the 3 components MPL, TDM or CWS may also 
be used alone or combined 2 by 2. Additionally, adjuvants such as Stimulon 
(Cambridge Bioscience, Worcester, MA) 
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5 



10 

Immunogenic compositions used as vaccines comprise a 'sufficient amount* 
or 'an immunologically effective amount' of the proteins of the present invention, as 
well as any other of the above mentioned components, as needed. 'Immunologically 

15 effective amount' , means that the administration of that amount to an individual, 
either in a single dose or as part of a series, is effective for treatment, as defined 
above. This amount varies depending upon the health and physical condition of the 
individual to be treated, the taxonomic group of individual to be treated (e.g. 
nonhuman primate, primate, etc.), the capacity of the individual's immune system 

20 to synthesize antibodies, the degree of protection desired, the formulation of the 
vaccine, the treating doctor's assessment of the medical situation, the strain of 
infecting HCV, and other relevant factors. It is expected that the amount will fall in 
a relatively broad range that can be determined through routine trials. Usually, the 
amount will vary from 0.01 to 1000 //g/dose, more particularly from 0.1 to 100 

25 /;g/dose. 

The proteins of the invention may also serve as vaccine carriers to present 
homologous (e.g. T cell epitopes or B cell epitopes fromfor istance the core,E1, E2, 
NS2, NS3, NS4 or NS5 regions) or heterologous (non-HCV) haptens, in the same 
manner as Hepatitis B surface antigen (see European Patent Application 174,444). 
30 In this use, envelope proteins provide an immunogenic carrier capable of stimulating 
an immune response to haptens or antigens conjugated to the aggregate. The antigen 
may be conjugated either by conventional chemical methods, or may be cloned into 
the gene encoding E1 and/or E2 at a location corresponding to a hydrophilic region 
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of the protein. Such hydrophylic regions include the VI region (encompassing amino 
acid positions 191 to 202), the V2 region (encompassing amino acid positions 213 
to 223), the V3 region (encompassing amino acid positions 230 to 242), the V4 
region (encompassing amino acid positions 230 to 242), the V5 region 
5 (encompassing amino acid positions 294 to 303) and the V6 region (encompassing 
amino acid positions 329 to 336). Another useful location for insertion of haptens 
is the hydrophobic region (encompassing approximately amino acid positions 264 to 
293). It is shown in the present invention that this region can be deleted without 
affecting the reactivity of the deleted E1 protein with antisera. Therefore, haptens 
10 may be inserted at the site of the deletion. 

The immunogenic compositions are conventionally administered parenteral^, 
typically by injection, for example, subcutaneously or intramuscularly. Additional 
formulations suitable for other methods of administration include oral formulations 
and suppositories. Dosage treatment may be a single dose schedule or a multiple 
15 dose schedule. The vaccine may be administered in conjunction with other 
immunoregulatory agents. 

The administration of the immunogen(s) of the present invention may be for 
either a prophylactic or therapeutic purpose. When provided prophylactically, the 
immunogen(s) is provided in advance of any exposure to HCV or in advance of any 
20 symptom of any symptoms due to HCV infection. The prophylactic administration 
of the immunogen serves to prevent or attenuate any subsequent infection of HCV 
in a mammal. When provided therapeutically, the immunogen(s) is provided at (or 
shortly after) the onset of the infection or at the onset of any symptom of infection 
or disease caused by HCV. The therapeutic administration of the immunogen(s) 
25 serves to attenuate the infection or disease. 

In addition to use as a vaccine, the compositions can be used to prepare 
antibodies to HCV (E1) proteins. The antibodies can be used directly as antiviral 
agents. To prepare antibodies, a host animal is immunized using the E1 proteins 
native to the virus particle bound to a carrier as described above for vaccines. The 
30 host serum or plasma is collected following an appropriate time interval to provide 
a composition comprising antibodies reactive with the (E1) protein of the virus 
particle. The gamma globulin fraction or the IgG antibodies can be obtained, for 
example, by use of saturated ammonium sulfate or DEAE Sephadex, or other 
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techniques known to those skilled in the art. The antibodies are substantially free of 
many of the adverse side effects which may be associated with other anti-viral 

agents such as drugs. 

The present invention also relates particularly to a peptide corresponding to 
5 an amino acid sequence encoded by at least one of the HCV genomic sequences as 
defined above, with said peptide being unique to any of the HCV subtypes or types 
as defined in Table 5, and which contains at least one amino acid differing from any 
of the known HCV types or subtypes, or an analog thereof being substantially 
homologous and biologically equivalent. 
1 o The present invention relates particularly to a peptide comprising at least one 

unique epitope of the new sequences of the invention as represented in SEQ ID NO 
1 to 106. 

The present invention relates also particularly to a peptide comprising in its 
sequence a unique amino acid residue of the invention as defined above. 
1 5 The present invention relates particularly to a peptide which is biotinylated as 

explained in WO 93/18054. 

All the embodiments (immunoassay formats, vaccines, compositions, uses, 
etc.) illustrated for the polypeptides of the invention as above also relate to the 
peptides of the invention. 
20 The present invention also relates to a method for detecting antibodies to HCV 

present in a biological sample, comprising: 

(i) contacting the biological sample to be analysed for the presence of HCV with a 
peptide as defined above, 

(ii) detecting the immunological ccomplex formed between said antibodies and said 
25 peptide. 

The present invention also relates to a method for HCV typing, comprising: 

(i) contacting the biological sample to be analysed for the presence of HCV with a 
peptide as defined above, 

(ii) detecting the immunological ccomplex formed between said antibodies and said 
30 peptide. 

The present invention also relates to a diagnostic kit for use in detecting the 
presence of HCV, said kit comprising at least one peptide as defined above, with said 
peptide being preferably bound to a solid support. 
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The present invention also relates to a diagnostic kit for HCV typing, said kit 
comprising at least one peptide as defined above, with said peptide being preferably 
bound to a solid support. 

The present invention also relates to a diagnostic kit as defined above, wherein 
5 said peptides are selected from the following: 

- at least one NS4 peptide, 

- at least one NS4 peptide and at least one Core peptide, 

- at least one NS4 peptide and at least one Core peptide and at least one E1 peptide, 

- at least one NS4 peptide and at least one E1 peptide. 

10 The present invention also relates to a diagnostic kit as defined above, said 

kit comprising a range of said peptides which are attached to specific locations on 
a solid substrate. 

The present invention also relates to a diagnostic kit as defined above, wherein 
said solid support is a membrane strip and said peptides are coupled to the 
1 5 membrane in the form of parallel lines. 

The present invention also relates to a pharmaceutical composition comprising 
at least one as defined above and a suitable excipient, diluent or carrier. 

the present invention also relates to a method of preventing HCV infection, 
comprising administering the pharmaceutical composition as defined above to a 
20 mammal in effective amount to stimulate the production of protective antibody or 
protective T-cell response. 

The present invention also relates to the use of a composition as defined 
above in a method for preventing HCV infection. 

The present invention also relates to a vaccine for immunizing a mammal 
25 against HCV infection, comprising at least one peptide as defined above, in a 
pharmaceutically acceptable carrier. 

The present invention relates also to a vaccine as defined above, comprising 
at least one peptide as defined above, with said peptide being unique for at least one 
of the subtypes or types as defined in Table 5. 
30 The present invention relates to an antibody raised upon immunization with 

at least one polypeptide or peptide as defined above, with said antibody being 
specifically reactive with any of said polypeptides or peptides, and with said antibody 
being preferably a monoclonal antibody. 
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The monoclonal antibodies of the invention can be produced by any 
hybridoma liable to be formed according to classical methods from splenic cells of 
an animal, particularly from a mouse or rat, immunized against the HCV polypeptides 
according to the invention as defined above on the one hand, and of cells of a 
myeloma cell line on the other hand, and to be selected by the ability of the 
hybridoma to produce the monoclonal antibodies recognizing the polypeptides which 
has been initially used for the immunization of the animals. 

The antibodies involved in the invention can be labelled by an appropriate label 
of the enzymatic, fluorescent, or radioactive type. 

The monoclonal antibodies according to this preferred embodiment of the 
invention may be humanized versions of mouse monoclonal antibodies made by 
means of recombinant DNA technology, departing from parts of mouse and/or 
human genomic DNA sequences coding for H and L chains or from cDNA clones 

coding for H and L chains. 

Alternatively the monoclonal antibodies according to this preferred 
embodiment of the invention may be human monoclonal antibodies. These 
antibodies according to the present embodiment of the invention can also be derived 
from human peripheral blood lymphocytes of patients infected with HCV type 1 
subtype Id, 1e, If or 1g, HCV type 2 subtype 2e, 2f, 2g, 2h, 2i, 2k or 21; HCV type 
3, subtype 3g; HCV type 4 subtype 4k, 41 or 4m; and/or HCV type 7 (subtypes 7a, 
7c or 7d), 9, 10 or 11, or vaccinated against HCV. Such human monoclonal 
antibodies are prepared, for instance, by means of human peripheral blood 
lymphocytes (PBL) repopulation of severe combined immune deficiency (SCID) mice 
(for recent review, see Duchosal et al. 1992) or by screening Eppstein Barr-virus- 
transformed lymphocytes of infected or vaccinated individuals for the presence of 
reactive B-cells by means of the antigens of the present invention. 

The invention also relates to the use of the proteins of the invention, muteins 
thereof, or peptides derived therefrom for the selection of recombinant antibodies by 
the process of repertoire cloning (Persson et al., 1991). 

Antibodies directed to peptides derived from a certain genotype may be used 
either for the detection of such HCV genotypes, or as therapeutic agents. 

The present invention relates also to a method for detecting HCV antigens 
present in a biological sample, comprising: 
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(i) contacting said biological sample with an antibody as defined above, 

(ii) detecting the immune compleexes formed between said HCV antigens and said 
antibody. 

The present invention relates also to a method for HCV typing, comprising: 
5 (i) contacting said biological sample with an antibody as defined above, 

(ii) detecting the immune compleexes formed between said HCV antigens and said 
antibody. 

The present invention relates also to a diagnostic kit for use in detecting the 
presence of HCV, said kit comprising at least one antibody as defined above, with 
1 0 said antibody being preferably bound to a solid support. 

The present invention relates also to a diagnostic kit for HCV typing, said kit 
comprising at least one antibody as defined above, with said antibody being 
preferably bound to a solid support. 

The present invention relates also to a diagnostic kit as defined above, said 
1 5 kit comprising a range of said antibodies which are attached to specific locations on 
a solid substrate. 

The present invention relates also to a pharmaceutical composition comprising 
at least one antibody as defined above and a suitable excipient, diluent or carrier. 
The present invention relates also to a method of preventing or treating HCV 
20 infection, comprising administering the pharmaceutical composition as defined above 
to a mammal in effective amount. 

The present invention relates also to the use of a composition as defined 
above in a method for preventing or treating HCV infection. 

The genotype may also be detected by means of a type-specific antibody as 
25 defined above, which may also linked to any polynucleotide sequence that can 
afterwards be amplified by PCR to detect the immune complex formed (Immuno-PCR, 
Sano et al., 1992). 

Any publications or patent applications referred to herein are incorporated by 
reference. The following examples illustrate aspects of the invention but are in no 
30 way intended to limit the scope thereof. 
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FIGURE LEGENDS 



Figure Legends 
Figure 1 



Alignment of the nucleotide sequences of the Core/E1 region of some of the isolates 
of the newly identified types and subtypes of the present invention, with other 
known prototype isolates of subtypes. 



Figure 2 

Alignment of the amino acid sequences of the Core/E1 region of some of the isolates 
of the newly identified types and subtypes of the present invention, with other 
10 known prototype isolates of subtypes. 

Figure 3 

Nucleotide and amino acid sequences obtained from the new HCV isolates of the 
present invention (SEQ ID NO 1 to 106). 

Figure 4 

1 5 Alignment of the amino acid sequences of the Core/E1 region of some of the isolates 
of the newly identified types and subtypes of the present invention, with other 
known prototype isolates of subtypes. 

Figure 5 

Alignment of the nucleotide sequences of the NS5b region of some of the isolates 
20 of the newly identified types and subtypes of the present invention, with other 
known prototype isolates of subtypes. 



SUBSTITUTE SHEET (RULE 26) 



WO 96/13590 



PCT/EP95/04155 



50 

Figure 6 

Alignment of the amino acid sequences of the NS5b region of some of the isolates 
of the newly identified types and subtypes of the present invention, with other 
known prototype isolates of subtypes. 

5 Table 5 

Overview of the new subtypes and types of the present invention and the regions 
sequenced. The subtypes between barckets have been replaced by the non- 
bracketed subtypes following the classification of Tokita et al. (1994). 

Examples 

10 Serum samples. 

Serum samples from Cameroonian blood donors (CAM) were screened for HCV 
antibodies with Innotest HCV Ab III, and confirmed by INNO-LIA HCV III 
(Innogenetics, Antwerp, Belgium). Serum samples from patients with chronic 
hepatitis C infection were obtained from various centers in the Benelux countries 
15 (BNL), from France (FR), from Pakistan (PAK), from Egypt (EG), and from Vietnam 
(VN). 

Samples from the Benelux, Cameroon, France and Vietnam were selected 
because of their aberrant reactivities (isolates CAM1078, FR2, FR1, VN4, VN12, 
VN13, NE98 and others (see Table 5)). 

20 cPCR, LiPA, cloning and sequencing. 

RNA isolation, cDNA synthesis, PCR, cloning, and LiPA genotyping using 
biotinylated 5' UR amplification products were performed as described (Stuyver et 
al., 1994c). The 5' UR, the Core/El # and the NS5B PCR products were used for 
direct sequencing. The sequence of the universal 5' UR primers HCPr95, HCPr96, 
25 HCPr98, and HCPr29, were described previously (Stuyver et al. 1993b). The 
following primers were also described (Stuyver et al. 1994c): HCPr41, a sense 
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primer for the amplification of the Core region; HCPr52 and HCPr54 for amplification 
of the Core/E1 region; and HCPr206 and HCPr207 for amplication of a 340-bp NS5B 
region. 

Serum samples BNL1, BNL2, BNL3, BNL4, BNL5, BNL6, BNL7, BNL8, BNL9, 
BNL1 0, BNL1 1 , BNL1 2, CAM1 078, FR2, FR1 6, FR4, FR1 3, VN1 3, VN4, VN1 2, FR1 , 
NE98, and FR1 9 were analyzed in the Core/E1 region by direct sequencing. Serum 
samples BNL1 , BNL2, FR17, CAM1078, FR2, FR16, BNL3, FR4, BNL5, FR13, FR18, 
PAK64, BNL8, BNL12, EG81, VN13, VN4, VN12, FR1 , NE98, FR14, FR15, and 
FR19 were also analyzed in the NS5B region by direct sequencing. Partial 5' UR, 
Core, E1 , and NS5B sequences were obtained. The length of the obtained sequences 
is sufficient to classify the obtained sequences into new types or subtypes, based 
on the phylogenetic distances to known sequences. The following sequences could 
be obtained (nucleotide sequences have odd-numbered SEQ ID NO., amino acid 
sequences have even-numbered SEQ ID NO.): SEQ ID NO 1, 3, 5, 7, 9, 11, 13, 1 5, 
17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 
57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 
97, 99, 101, 103 and 105. The amino acid sequences deduced therefrom are given 
in SEQ ID NO 2, 4, 7, 9, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 
40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 
80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104 and 106. Table 5 gives an 
overview of these sequences. 
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Phylogenetic analysis. 

Previously published sequences were taken from the EMBL/Genbank database. 
Alignments were created using the program HCVALIGN (Stuyver et al. 1994c). 
Sequences were presented in a sequential format to the Phylogeny Inference Package 
5 (PHYLIP) version 3.5c (public domain program freely available from the University of 
Washington, Seattle, USA). Distance matrices were produced by DNADIST using the 
Kimura 2-parameter setting and further analyzed in NEIGHBOR, using the neighbor- 
joining setting. The program DRAWTREE was used to create graphic outputs. 

Identification of new subtypes 

10 These analyses indicated the clustering of BNL1 , BNL2, CAM 1 078, FR2, FR1 6, 

and FR1 7 with type 1 isolates, yet neither of these sequences clustered together with 
any of the known type 1 subtypes 1a, 1b, or 1c. BNL1, BNL2, and FR17 clearly 
clustered together and could be assigned a new type 1 subtype 1d, while CAM 1078 
could be classified into another new subtype 1 e, FR2 could be classified into another 

15 type 1 subtype 1f, and FR16 could be classified into yet another type 1 subtype 1g. 
Interestingly, all 3 type 1d isolates (BNL1 , BNL2, and FR1 7) and 1g isolate FR1 6 were 
obtained from patients of Moroccan ethnic origin who resided in Europe. 

Another group of isolates showed homology to other type 2 sequences, but 
none of the isolates BNL3, FR4, BNL4, BNL5, BNL6, FR1 3, or FR1 8 could be classified 

20 into one of the known type 2 subtypes 2a, 2b, 2c (Bukh et al., 1 993), or 2d (Stuyver 
et al., 1994c). Based on the phylogenetic distances to other type 2 isolates and to 
other isolates of the group, each of these isolates could be classified into a new type 
2 subtype. BNL3 was assigned subtype 2e, FR4 subtype 2f, BNL4 subtype 2g, BNL5 
subtype 2h, and BNL6 could be classified into yet another type 2 subtype 2i. If the 

25 previously published isolate HN4 is classified as 2j, FR13 and FR1 8 may be classified 
into new type 2 subtypes 2k and 21. However, the possibility that FR13 and FR18 
* could belong to subtypes 2g or 2i has not yet been ruled out. Definite classification 

can be obtained by determining the NS5B sequences of isolates BNL4 and BNL6, 
? belonging to subtypes 2g and 2i, respectively. 

30 Isolate PAK64 showed homology to type 3 sequences, but could not be 

classified into one of the known type 3 subtypes 3a to f . Based on the phylogenetic 
distances to other type 3 isolates, PAK64 could be classified into a new type 3 
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subtype. PAK64 was assigned subtype 3g. However, the possibility that PAK64 
belongs to a known type 3 subtype can not be strictly ruled out since only one region 
of the genome has been sequenced. Definite classification can be obtained by 
determining the Core/E1 sequences of isolate PAK64 after amplification with 
primerHcPr52 and HcPr54. 

Among the Benelux and Egyptian samples that were analyzed, some sequences 
clustered with the previously identified type 4 subtypes 4c and 4d. However, BNL7, 
BNL8, BNL9, BNL10, BNL1 1, BNL12, and EG81 clustered into new subtypes of type 
4. Isolates BNL7, BNL8, BNL9, BNL10, and BNL1 1 clustered again separately from 
BNL12 and EG81 into a new subtype 4k. This subtype was the predominant subtype 
in the Benelux countries, BNL12 and EG81 also segregated into separate subtypes. 
BNL12 was assigned to another new subtype 41 and EG81 was assigned to yet 
another new subtype 4m. 
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Iftentification of new HC V maior types 

Isolates FR1, VN4, VN12, VN13, NE98, FR14, FR1 5, and FR19did not cluster 
with any of the known 6 major types of HCV. VN4, VN12, and VN13 were very 
distantly related to genotype 6, but phylogenetic analysis indicated that these isolates 
5 should be assigned new major types. VN13, VN4 and VN12 were related at the 
subtype level and assigned type 7a, 7c, and 7d, respectively. FR1 was not related to 
any known isolate and was assigned genotype 9a. NE98 shows a distant relatedness 
to type 3 sequences, yet phylogenetic analysis suggested classification into a new 
major type 10a. Depending on international guidelines for assigning type and subtype 

10 levels, NE98 may also be classified into an additional type 3 subtype. FR14, FR15, 
and FR19 show a very distant relatedness to type 2 sequences, yet phylogenetic 
analysis indicated thes isolates to be classified into a new major type 1 1 , all belonging 
to the same subtype designated 11a. Depending on international guidelines for 
assigning type and subtype levels, FR14, FR15, and FR19 may also be classified into 

15 an additional type 2 subtype. 



» 
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CLAIMS 

1. An HCV polynucleic acid, having a nucleotide sequence which is unique to a 
theretofore unidentified HCV type or subtype which is different from HCV subtypes 
1a, 1b, 1c, 2a, 2b, 2c, 2d, 3a, 3b, 3c, 3d, 3e, 3f, 4a, 4b, 4c, 4d, 4e, 4f, 4g, 4h, 4i, 
4j, 5a or 6a, with said HCV subtypes being classified as in Table 3 by comparison of 
a part of the NS5 gene nucleotide sequence spanning positions 7932 to 8271, with 
said amino acid numbering being shown in Table 1 , and with said polynucleic acid 
containing at least one nucleotide differing from said known HCV nucleotide 
sequences, or the complement thereof. 

2. A polynucleic acid according to claim 1 , having a nucleotide sequence which is 
unique to at least one of HCV subtypes 1d, 1e, 1f, 1g, 2e, 2f, 2g, 2h, 2i, 2k, 21, 3g, 
4k, 41, 4m, 7a, 7c or 7d, with said HCV subtypes being classified as defined in claim 
1. 

3. A polynucleic acid according to claim 1 , having a nucleotide sequence which is 
unique to at least one of HCV types 9, 10 or 1 1 , with said HCV types being classified 
as defined in claim 1 . 

4. A polynucleic acid according to any of claims 1 to 3 encoding an HCV polyprotein 
comprising in its amino acid sequence at least one of the following amino acid 
residues: 

115, C38, V44, A49, Q43, P49, Q55, A58, S60 or D60, E68 or V68, H70, A71 or 
Q71 or N71, D72, H81, H101, D106, S110, L130, 1134, E135, L140, S148, T150 
or E150, Q153, F155, D157, G160, E165, 1169, F181. L186, T190, T192 or 1192 
or H192, 1193, A195, S196, R197 or N197 or K197, Q199 or D199 or H199 or 
N1 99, F200 or T200, A208, 121 3, M21 6 or S21 6, N21 7 or S21 7 or G21 7 or K21 7, 
T21 8, 121 9, A222, Y223, I230, W231 or L231 , S232 or H232 or A232, Q233, E235 
or L235, F236 or T236, F237, L240 or M240, A242, N244, N249, I250 or K250 or 
R250, A252 or C252, A254, I255 or V255, D256 or M256, E257, E260 or K260, 
R261, V268, S272 or R272, I285, G290 or F290, A291, A293 or L293 or W293, 
T294 or A294, S295 or H295, K296 or E296, Y297 or M297, I299 or Y299, I300, 
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S301, P316, S2646, A2648, G2649, A2650, V2652, Q2653, H2656 or L2656, 
D2657, F2659, K2663 or Q2663, A2667 or V1667, D2677, L2681, M2686 or 
Q2686 or E2686, A2692 or K2692, H2697, I2707, L2708 or Y2708, A2709, A271 9 
or M271 9, F2727, T2728 or D2728, E2729, F2730 or Y2730, 12741 , I2745, V2746 
5 or E2746 or L2746 or K2746, A2748, S2749 or P2749, R2750, E2751, D2752 or 
N2752 or S2752 or T2752 or V2752 or I2752 or Q2752, S2753 or D2753 or G2753, 
D2754, A2755, L2756 or Q2756, R2757, 

with said notation being composed of a letter representing the amino acid residue by 
its one-letter code, and a number representing the amino acid numbering as shown in 
10 Table 1, 

or a part of said polynucleic acid which is unique to at least one of the HCV subtypes 
or types as defined in claims 2 to 3, and which contains at least one nucleotide 
differing from known HCV nucleotide sequences, or the complement thereof. 

5. A polynucleic acid according to any of claims 1 to 4, with said polynucleic acid 
1 5 encoding a HCV polyprotein comprising in its amino acid sequence at least one amino 
acid sequence chosen from the following list: 



20 



25 



30 



ARQSDGRSWAQ or ARRSEGRSWAQ as for subtype 1d 


(SEQ ID NO 107 and 108) 


ERRPEGRSWAQ as for subtype 1 e 


(SEQ ID NO 109) 


ARRPEGRSWAQ as for subtype 1f 


(SEQ ID NO 110) 


DRRTTGKSWGR as for subtype 2k 


(SEQ ID NO 111) 


DRRATGRSWGR as for subtype 2e 


(SEQ ID NO 112) 


DRRATGKSWGR as for subtype 2f 


{SEQ ID NO 113) 


VRQPTGRSWGQ as for type 9 


(SEQ ID NO 114) 


VRHQTGRTWAQ as for subtype 7a and 7c 


(SEQ ID NO 115) 


VRQNQGRTWAQ as for subtype 7d 


(SEQ ID NO 116) 


ARRTEGRSWAQ as for type 1 0 


(SEQ ID NO 117) 


VRRTTGRXXXX or VRRTTGRTWAQ as for type 1 1 


(SEQ ID NO 118 and 


119) 




HEVRNASGVYHV or HEVRNASGVYHL as for subtype 1d 


(SEQ ID NO 120 and 121) 


YEVHSTTDGYHV as for subtype 1f 


(SEQ ID NO 122) 


VEVKNTSQAYMA as for subtype 2e 


(SEQ ID NO 123) 


IQVKNNSHFYMA as for subtype 2f 


(SEQ ID NO 124) 
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VQVKNTSTMYMA as for subtype 2g (SEQ ID NO 125) 

VQVKNTSHSYMV as for subtype 2h (SEQ ID NO 126) 

VQVANRSGSYMV as for subtype 2i (SEQ ID NO 127) 

VEIKNTXNTYVL or VEIKNTSNTYVL as for subtype 2k (SEQ ID NO 128 and 129) 
5 INYRNVSGIYYV or INYRNTSGIYHV or INYHNTSGIYHI or TNYRNVSGIYHV as for 
subtype 4k (SEQ ID NO 130, 131, 132 or 133) 

QHYRNVSGIYHV as for subtype 41 (SEQ ID NO 134) 

IQVKNASGIYHL as for type 9 (SEQ ID NO 1 35) 

AHYTNKSGLYHL as for subtype 7c (SEQ ID NO 136) 

10 LNYANKSGLYHL as for subtype 7d (SEQ ID NO 137) 

LEYRNASGLYMV as for type 1 0 (SEQ ID NO 1 38) 

IYEMDGMIMHY or IYEMSGMILHA as for subtype Id (SEQ ID NO 139 and 140) 
VYEAKDIILHT as for subtype 1f (SEQ ID NO 141) 

VWQLXDAVLHV as for subtype 2e (SEQ ID NO 142) 

1 5 VWQLRDAVLHV as for subtype 2f (SEQ ID NO 143) 

IWQMQGAVLHV as for subtype 2g (SEQ ID NO 144) 

VWQLKDAVLHV as for subtype 2h (SEQ ID NO 145) 

VWQLEEAVLHV as for subtype 2i (SEQ ID NO 146) 

TWQLXXAVLHV as for subtype 2k (SEQ ID NO 147) 

20 VYEADHHILHL or VYEADHHILAL or VFEADHHILHL as for subtupe 4k 

(SEQ ID NO 148, 149 and 150) 
VYESDHHILHL as for subtype 41 (SEQ ID NO 1 51 ) 

VFEAETMILHL as for type 9 (SEQ ID NO 1 52) 

VYEAETLILHL as for subtype 7c (SEQ ID NO 1 53) 

25 VYEANGMILHL as for subtype 7d (SEQ ID NO 154) 

VYEAGDIILHL as for type 10 (SEQ ID NO 155) 

VREDNHLRCWMAL or VRENNSSRCWMAL as for subtype 1d 



30 





(SEQ ID NO 156 and 157) 


IREGNISRCWVPL as for subtype 1f 


(SEQ ID NO 158) 


ENSSGRFHCWIPI as for subtype 2e 


(SEQ ID NO 159) 


ERSGNRTFCWTAV as for subtype 2f 


(SEQ ID NO 160) 


ELQGNKSRCWIPV as for subtype 2g 


(SEQ ID NO 162) 


ERHQNQSRCWIPV as for subtype 2h 


(SEQ ID NO 163) 
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EWKDNTSRCWIPV as for subtype 2i (SEQ ID NO 1 64) 

EREGNSSRCWIPV as for subtype 2k (SEQ ID NO 165) 
VREGNQSRCWVAL or VRTGNQSRCWVAL or VRVGNQSSCWVAL or 
VRVGNQSRCWVAL or VKEGNHSRCWVAL as for subtype 4k 
5 (SEQ ID NO 166, 167, 168 or 169) 

VKTGNTSRCWVAL as for subtype 41 (SEQ ID NO 170) 

IKAGNESRCWLPV as for type 9 (SEQ ID NO 171) 

VKEGNQSRCWVQA as for subtype 7c (SEQ ID NO 172) 

VKXXNLTKCWLSA as for subtype 7d (SEQ ID NO 173) 

10 VRSGNTSRCWIPV as for type 10 (SEQ ID NO 174) 

VKNASVPTAA or VKDANVPTAA as for subtype 1d (SEQ ID NO 175 and 
176) 

ARIANAPIDE as for subtype 1f (SEQ ID NO 177) 

VSKPGALTKG as for subtype 2e (SEQ ID NO 178) 

15 VSRPGALTRG as for subtype 2f (SEQ ID NO 179) 

VNQPGALTRG as for subtype 2g (SEQ ID NO 180) 

VSQPGALTRG as for subtype 2h (SEQ ID NO 181) 

VSQPGALTKG as for subtype 2i (SEQ ID NO 182) 

VSRPGALTEG as for subtype 2k (SEQ ID NO 183) 

20 APYIGAPLES or APYTAAPLES as for subtype 4k (SEQ ID NO 1 84 and 1 85) 

APILSAPLMS as for subtype 41 (SEQ ID NO 186) 

VPNSSVPIHG as for type 9 (SEQ ID NO 187) 

VPNASTPVTG as for subtype 7c (SEQ ID NO 188) 

VQNASVSIRG as for subtype 7d (SEQ ID NO 1 89) 

25 VKSPCAATAS as for type 10 (SEQ ID NO 190) 

SPRMHHTTQE or SPRLYHTTQE as for subtype 1d (SEQ ID NO 191 and 192) 

TSRRHWTVQD as for subtype 1f (SEQ ID NO 193) 

APKRHYFVQE as for subtype 2e (SEQ ID NO 194) 

SPQYHTFVQE as for subtype 2f (SEQ ID NO 195) 

SPQHHNFSQD as for subtype 2g (SEQ ID NO 1 96) 

SPQHHIFVQD as for subtype 2h (SEQ ID NO 1 97) 

SPEHHHFVQD as for subtype 2k (SEQ ID NO 198) 
RPRRHWTTQD or RPRRHWTAQD or QPRRHWTTQD or RPRRHWTTQE as for 
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subtype 4k 

QPRRHWTVQD as for subtype 41 
RPKYHQVTQD as for type 9 
RPRMHQVVQE as for subtype 7c 
RPRMYEIAQD as for subtype 7d 
RHRQHWTVQD as for type 10 



(SEQ ID NO 199, 200, 201 or 202) 



(SEQ ID NO 203) 



(SEQ ID NO 204) 



(SEQ ID NO 205) 



(SEQ ID NO 206) 



(SEQ ID NO 207) 



or a part of said polynucleic acid which is unique to at least one of the HCV subtypes 
or types as defined in claims 2 to 3 f and which contains at least one nucleotide 
differing from known HCV nucleotide sequences, or the complement thereof. 

6. A polynucleic acid according to any of claims 1 to 5 having a sequence selected 
from any of SEQ ID NO 1 to 105, or a part of said polynucleic acid which is unique 
to at least one of the HCV subtypes or types as defined in claims 2 to 3, and which 
contains at least one nucleotide differing from known HCV nucleotide sequences, or 
the complement thereof. 

7. A polynucleic acid according to any of claims 1 to 6, which codes for the 5' UR, 
the Core/E1 , the NS4 or the NS5B region or a part thereof. 

8. A polynucleic acid according to any of claims 1 to 7 which is a cDNA sequence. 

9. An oligonucleotide primer comprising part of a polynucleic acid according to any of 
claims 1 to 8, with said primer being able to act as primer for specifically amplifying 
the nucleic acid of a certain isolate belonging to the genotype from which the primer 
is derived. 

10. An oligonucleotide probe comprising part of a polynucleic acid according to any 
of claims 1 to 8, with said probe being able to act as a hybridization probe for specific 
detection and/or classification into types and/or subtypes of a HCV nucleic acid 
containing said nucleotide sequence, with said probe being possibly labelled or 
attached to a solid substrate. 

1 1. A diagnostic kit for use in determing the genotype of HCV, said kit comprising a 
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primer according to claim 9. 

12. A diagnostic kit for use in determining the genotype of HCV, said kit comprising 
a probe according to claim 10. 

13. A diagnostic kit according to claim 12, wherein said probe(s) is(are) attached to 
a solid substrate. 

14. A diagnostic kit according to claim 13, wherein a range of said probes are 
attached to specific locations on a solid substrate. 

15. A diagnostic kit according to claim 14, wherein said solid support is a membrane 
strip and said probes are coupled to the membrane in the form of parallel lines. 

16. A method for the detection of HCV nucleic acids present in a biological sample, 
comprising: 

(i) possibly extracting sample nucleic acid, 

(ii) amplifying the nucleic acid with at least one primer according to claim 9, 

(iii) detecting the amplified nucleic acids. 

17. A method for the detection of HCV nucleic acids present in a biological sample, 
comprising: 

(i) possibly extracting sample nucleic acid, 

(ii) possibly amplifying the nucleic acid with at least one primer according to 
\ claim 9, or with a universal HCV primer, 

(iii) hybridizing the nucleic acids of the biological sample, possibly under 
denatured conditions, at appropriate conditions with one or more probes 

v according to claim 10, with said probes being possibly attached to a solid 
substrate, 

(iv) possibly washing at appropriate conditions, 

(v) detecting the hybrids formed. 

18. A method for detecting the presence of one or more HCV genotypes present in 
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a biological sample, comprising: 

(i) possibly extracting sample nucleic acid, 

(ii) specifically amplifying the nucleic acid with at least one primer according to 
claim 9, 

(iii) detecting said amplified nucleic acids, 

(iv) inferring the presence of one or more genotypes of HCV present from the 
observed pattern of amplified fragments. 

19. A method for detecting the presence of one or more HCV genotypes present in 
a biological sample, comprising: 

(i) possibly extracting sample nucleic acid, 

(ii) possibly amplifying the nucleic acid with at least one primer according to 
claim 9 or with a universal HCV primer, 

(iii) hybridizing the nucleic acids of the biological sample, possibly under 
denatured conditions, at appropriate conditions with one or more probes 
according to claim 1 0, with said probes being possibly attached to a solid 
substrate, 

(iv) possibly washing at appropriate conditions, 

(v) detecting the hybrids formed, 

(vi) inferring the presence of one or more HCV genotypes present from the 
observed hybridization pattern. 

20. A method according to claim 19, wherein said probes are further characterized as 
defined in any of claims 1 3 to 15. 

21. A method according to claims 1 6 to 18, wherein said nucleic acids are labelled 
during or after amplification. 

22. A polypeptide having an amino acid sequence encoded by a polynucleic acid 
according to any of claims 1 to 8, or a part thereof which is unique to at least one of 
the HCV subtypes or types as defined in claims 2 or 3, and which contains at least 
one amino acid differing from any of the known HCV types or subtypes amino acid 
sequences, or an analog thereof being substantially homologous and biologically 
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equivalent. 

23. A polypeptide according to claim 22 comprising in its amino acid sequence at least 
one of the following amino acid residues: 

115, C38, V44, A49, Q43, P49, Q55, A58, S60 or D60, E68 or V68, H70, A71 or 
5 Q71 or N71, D72, H81, H101, D106, S1 10, L130, 1134, E135, L140, S148, T150 
or E150, Q153, F155, D157, G160, E165, 1169, F181, L186, T190, T192 or 1192 
or H192, 1193, A195, S196, R197 or N197 or K197, Q199 or D199 or H199 or 
N199, F200 or T200, A208, 1213, M216 or S216, N217 or S217 orG217 or K217, 
T21 8, 121 9, A222, Y223, 1230, W231 or L231 , S232 or H232 or A232, Q233, E235 

10 or L235, F236 or T236, F237, L240 or M240, A242, N244, N249, I250 or K250 or 
R250, A252 or C252, A254, I255 or V255, D256 or M256, E257, E260 or K260, 
R261, V268, S272 or R272, I285, G290 or F290, A291, A293 or L293 or W293, 
T294 or A294, S295 or H295, K296 or E296, Y297 or M297, I299 or Y299, I300, 
S301, P316, S2646, A2648, G2649, A2650, V2652, Q2653, H2656 or L2656, 

15 D2657, F2659, K2663 or Q2663, A2667 or V2667, D2677, L2681, M2686 or 
Q2686 or E2686, A2692 or K2692, H2697, I2707, L2708 or Y2708, A2709, A271 9 
or M271 9, F2727, T2728 or D2728, E2729, F2730 or Y2730, 12741 , I2745, V2746 
or E2746 or L2746 or K2746, A2748, S2749 or P2749, R2750, E2751, D2752 or 
N2752 or S2752 or T2752 or V2752 or I2752 or Q2752, S2753 or D2753 or G2753, 

20 D2754, A2755, L2756 or Q2756, or R2757, 

with said notation being composed of a letter representing the amino acid residue by 
its one-letter code, and a number representing the amino acid numbering as shown in 
Table 1, 

or a part of said polypeptide which is unique to at least one of the HCV subtypes or 
25 types as defined in claims 2 to 3, and which contains at least one amino acid differing 
from known HCV types or subtypes amino acid sequences, or an analog thereof being 
substantially homologous and biologically equivalent to said polypeptide. 

24. A polypeptide according to claim 22 comprising in its amino acid sequence at least 
one of the sequences represented by SEQ ID NO 107 to 207 as listed in claim 5, or 

30 part of said polypeptide which is unique to at least one of the HCV subtypes or types 
as defined in claims 2 to 3, and which contains at least one amino acid differing from 
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known HCV types or subtypes amino acid sequences, or an analog thereof being 
substantially homologous and biologically equivalent to said polypeptide. 

25. A polypeptide having an amino acid sequence as represented in any of SEQ ID NO 
1 to 106, or a part thereof which is unique to at least one of the HCV subtypes or 

5 types as defined in claims 2 to 3 f and which contains at least one amino acid differing 
from known HCV types or subtypes amino acid sequences, or an analog thereof being 
substantially homologous and biologically equivalent to said polypeptide. 

26. A recombinant polypeptide encoded by a polynucleic acid according to any of 
claims 1 to 8, or a part thereof which is unique to at least one of the HCV subtypes 

10 or types as defined in claims 2 or 3, and which contains at least one amino acid 
differing from known HCV types or subtypes amino acid sequences, or an analog 
thereof being substantially homologous and biologically equivalent to said polypeptide. 

27. A method for production of a recombinant polypeptide of claim 26, comprising: 

transformation of an appropriate cellular host with a recombinant vector, in 
1 5 which a polynucleic acid or a part thereof according to any of claims 1 to 8 has 

been inserted under the control of the appropriate regulatory elements, 
culturing said transformed cellular host under conditions enabling the expression 
of said insert, and, 
harvesting said polypeptide. 

20 28. A recombinant expression vector comprising a polynucleic acid or a part thereof 
according to any of claims 1 to 8 operably linked to prokaryotic, eukaryotic or viral 
transcription and translation control elements. 

29. A host cell transformed with a recombinant vector according to claim 28. 

30. A method for detecting antibodies to HCV present in a biological sample, 
25 comprising: 

(i) contacting the biological sample to be analysed for the presence of HCV with 

a polypeptide according to any of claims 22 to 26, 
SUBSTITUTE SHEET (RULE 26) 



WO 96/13590 PCT/EP95/04155 

73 

(ii) detecting the immunological complex formed between said antibodies and 

said polypeptide. 

31. A method for HCV typing, comprising: 

(i) contacting the biological sample to be analysed for the presence of HCV with 
a polypeptide according to any of claims 22 to 26, 

(ii) detecting the immunological complex formed between said antibodies and 
said polypeptide. 

32. A diagnostic kit for use in detecting the presence of HCV, said kit comprising at 
least one polypeptide according to any of claims 22 to 26, with said polypeptide being 
possibly bound to a solid support. 

33. A diagnostic kit for HCV typing, said kit comprising at least one polypeptide 
according to any of claims 22 to 26, with said polypeptide being possibly bound to a 
solid support. 

34. A diagnostic kit according to claims 32 to 33, said kit comprising a range of 
polypeptides which are attached to specific locations on a solid substrate. 

35. A diagnostic kit according to claims 32 to 34, wherein said solid support is a 
membrane strip and said polypeptides are coupled to the membrane in the form of 
parallel lines. 

36. A pharmaceutical composition comprising at least one polypeptide according to 
any of claims 22 to 26 and a suitable excipient, diluent or carrier. 

37. A method of preventing HCV infection, comprising administering the 
pharmaceutical compositon of claim 36 to a mammal in effective amount to stimulate 
the production of protective antibody or protective T-cell response. 



38. Use of a composition according to claim 36 in a method for preventing 
infection as defined in claim 37. 
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39. A vaccine for immunizing a mammal against HCV infection, comprising at least 
one polypeptide according to claims 22 to 26, in a pharmaceutical^ acceptable carrier. 

40. A vaccine according to claim 39, comprising at least one polypeptide according 
to claims 22 to 26, with said polypeptide being unique for at least one of the HCV 
subtypes as defined in claims 2 or 3. 

41 . A peptide corresponding to an amino acid sequence encoded by at least one of the 
HCV polynucleic acids according to any of claims 1 to 8, with said peptide comprising 
an epitope being unique to at least one of the HCV subtypes or types as defined in 
claims 2 or 3, and with said peptide containing at least one amino acid differing from 
any of the known HCV types or subtypes amino acid sequences, or an analog thereof 
being substantially homologous and biologically equivalent. 

42. A method for detecting antibodies to HCV present in a biological sample, 
comprising: 

(i) contacting the biological sample to be analysed for the presence of HCV with 
a peptide according to claim 41 , 

(ii) detecting the immune complex formed between said antibodies and said 

peptide. 

43. A method for HCV typing, comprising: 

(i) contacting the biological sample to be analysed for the presence of HCV with 
a peptide according to claim 41 , 

(ii) detecting the immune complex formed between said antibodies and said 
peptide. 

44. A. diagnostic kit for use in detecting the presence of HCV, said kit comprising at 
least one peptide according to claim 41, with said peptide being possibly bound to a 
solid support. 

45. A diagnostic kit for HCV typing, said kit comprising at least one peptide according 
to any of claim 41, with said peptide being possibly bound to a solid support. 

SUBSTITUTE SHEET (RULE 26) 



WO 96/13590 PCT/EP95/04155 

75 

46. A diagnostic kit according to claims 44 or 45, wherein said peptides are selected 
from the following list: 

at least one NS4 peptide, 

at least one NS4 peptide and at least one Core peptide, 
5 - at least one NS4 peptide and at least one Core peptide and at least one E1 
peptide, or, 

at least one NS4 peptide and at least one E1 peptide. 

47. A Diagnostic kit according to claims 44 to 46, said kit comprising a range of 
peptides which are attached to specific locations on a solid substrate. 

10 48. A diagnostic kit according to claims 44 to 47, wherein said solid support is a 
membrane strip and said peptides are coupled to the membrane in the form of parallel 
lines. 

49. A pharmaceutical composition comprising at least one peptide according to claim 
41 and suitable excipient, diluent or carrier. 

15 50. A method of preventing HCV infection, comprising administering the 
pharmaceutical composition of claim 49 to a mammal in effective amount to stimulate 
the production of protective antibody or protective T-cell response. 

51. Use of a composition according to claim 49 in a method for preventing HCV 
infection as defined in claim 50. 

20 52. A vaccine for immunizing a mammal against HCV infection, comprising at least 
one peptide according to claim 41, in a pharmaceutical^ acceptable carrier. 

53. A vaccine according to claim 52, comprising at least one peptide according to 
claim 41 , with said peptide being unique for at least one of the subtypes or types as 
defined in claims 2 or 3. 
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54. An antibody raised upon immunization with at least one polypeptide or peptide 
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according to any of claims 22 to 26 or 41, with said antibody being specifically 
reactive with any of said polypeptides or peptides, and with said antibody being 
preferably a monoclonal antibody. 



55. A method for detecting HCV antigens present in a biological sample, comprising: 
5 (i) contacting said biological sample with an antibody according to claim 54, 

(ii) detecting the immune complexes formed between said HCV antigens and 

said antibody. 



56. A method for HCV typing, comprising: 

(i) contacting said biological sample with an antibody according to claim 54, 

10 (ii) detecting the immune complexes formed between said HCV antigens and 

said antibody. 



57. A diagnostic kit for use in detecting the presence of HCV, said kit comprising at 
least one antibody according to claim 54, with said antibody being possibly bound to 
a solid support. 

15 58. A diagnostic kit for HCV typing, said kit comprising at least one antibody 
according to claim 54, with said antibody being possibly bound to a solid support. 

59. A diagnostic kit according to claims 57 to 58, said kit comprising a range of 
antibodies which are attached to specific locations on a solid substrate. 

60. A pharmaceutical composition comprising at least one antibody according to claim 
20 54 and a suitable excipient, diluent or carrier. 

61. A method of preventing or treating HCV infection, comprising administering the 
pharmaceutical composition of claim 62 to a mammal in effective amount. 

62. Use of a composition according to claim 60 in a method for preventing or treating 
HCV infection as defined in claim 61 . 
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Figure 1 - continued 

551 600 

HCV-1 la GCTTGACTGTGCCCGCTTCGGCCTACCAAGTGCGCAACTCCACGGGGCTT 

HCV-J lb -T CA-C — A C — T G-G -GTGT-C A-A 

HC-G9 1c — C A— C — T GT-GG TT G-G 

BNL1 Id G — T — AA-KA-C — TC — G-G G-AT-C G-G 

BNL2 Id G — T--AA — A-C — TC-TG-G G-AT-C G-A 

FR2 " If — C-C — A — C A-C--T TG-G A — G-A- A — C-ATGGC 

HC-J6 2a — A-C--CACC — G-TC — C — TGC-G AAG AT — GTACCGGC 

HC-J8 2b --G-C — A A-TG--T — AGTGG CA-G ATT-GTTCTAGC 

S83 2c — A-CT A-T C GTGG-G — CAAGG — A — GGC-ACTCC 

NE92 2d -TA-C G-TC — C-G — TG — G — CAAG A GCA-CTC- 

BNL3 2e -TG-C — C T-TC — T-N-GTTG-G — CAAA — TA GTCA-GCC 

FR4 2f -TA-C — C TG — T ATA — G — TAAG AA — GCCACT-C 

BNL4 2g -TG-C— C T-TC— T GTG — G — TAAG A GTACCA-G 

BNL5 2h -TC-C G — G — C — TGTG — G — CAAG A GCCACTC- 

BNL6 2i — A-C— C G-TC — T GTG TGCG CG — GT — TTC- 

NZL1 3a A-T - CAT — A — AG - C AG T C TAG- GTG G — TA-GT-T — C — C 

HCV-TR 3b TGC G — T- G — TAG- GTACACG A-GT-T— C— A 

NE4 8 3c GTCTGT— T— AG-A-GGCT-G-GTAC— G— TGTAT-C — C— C 

NE274 3d GTCTGT — T G-A-GGATTG — TAC — G — TGTGT-T — C— C 

NE145 3e CT-TGC — T — AGTC-GG-TGG-G — T G-AT-C— T—C 

NE125 3f GT-TCC AG GGCTAG-GTACA-G A-GT-C — C— A 

Z4 4a — C-C T — A — G TG-G — CTAC — G — TG-TT CA-C 

Zl 4b — C AACA — A— A— T GTG — CTAC— G— TG-TT CG-C 

GB358 4 c — C T A-C GT-A-CTAT TG— T CA-C 

DK13 4d — C T A-CTAT AG-T TG-C 

GB809 4e — C-C T G G-GTTA-CTAT TG-TT CG — 

BNL7 4 k — C C AT -A-CTAT TGT-T CA — 

BNL8 4 k — C T AT TA-C TAC A — T CA-C 

BNL9 4 k — C C AT TA-C TAC -A A— T CA-C 

BNL10 4k -TC C ACTA-CTAT GT-T CA-C 

BNL11 4 k — C C AC-A-CTAC TGT-T CA— 

BNL12 41 — C C — G — C TC-G—TTAT—G— TGT-T CA — 

BE95 5a -TC C — T — G — C — T — AGTT-CCTAC — A — TG — T-T A — 

HK2 6 a — C-C — AAC A TCTTACCTACG GT A 

FR1 7a — C-C ACA — A — C — A — AATT CAAG G — T-T A-C 

VN4 8 a — C-T — AACA — A — C — C — GGCG — TTATAC AAGT-T — C — G 

VN12 9a — C-C — CAC T — C — C — ACTAA-CTATGCT AAGT-T G 

NE98 10a CT-ACA A-AG-C-GGCTGG-GTAC — T — TG — T-C — A — C 
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Figure 1 - continued 
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HCV-1 la TACCAC GT CAC CAAT GAT TGC C C TAAC T C GAGT AT T GT GT AC GAGGCGGC 

HCV-J lb T G — C — C T-C A ■ T A — 

HC-G9 lc T C — TG — TCCG A A 

BNL1 Id — T — T C — C — TT-C C — CA-C — T AT — A 

BNL2 Id — T — TC C — TT-C C — CA-C — T AT-AG 

FR2 If T T C — TT-C GGC — C — C — A — T AAA 

HC-J6 2a ATG — G C — C A-C — TGAT — C ACC-GGC -ACTCCA 

HC-J8 2b T C T T-A AAC— C — CACC-GGC — CTCA- 

S83 2c ATGCCG C T-C T C — T-GGC — CTT-A 

NE92 2d ATG — A C AG AGT — C — C — C-GGC — CTCAG 

BNL3 2e — TATG-CA C — C T-C AAC — C — C— A-GGC-ATT — N 

FR4 2f ATG-CG — T C — TG-C — TGAC — C — C — C-GGC — CTCAG 

BNL4 2g ATG-CA C — TT-C AAC — C — CA-C-GGC-AAT-CA 

BNL5 2h — TATG — G T-A AGC — C C-GGC — CTTAA 

BNL6 2i ATG— G T-G AGC— C — C — T-GGC — CTC-A 

NZL1 3a GT-C-T C — C — TT-C — TAGC T C-A 

HCV-TR 3b — TGTGC-T C — C T TGG C C-A 

NE48 3c ATAC C — TT-G AGC — C — A T C-A 

NE274 3d GTGC C — C T GGC C T CC- 

NE145 3e ATGC C T-A AGC— C — A — A— T A 

NE125 3f ATAC-T C— C T AGC— C— C T T-A 

z4 4a — T A T — G — T — C A — C — T — A — T-A 

Zl 4b — T — T A-C — C — A A A 

GB358 4c — T A C G C— A A-C-A 

DK13 4d T C G C — A — C — T — AA-C-A 

GB809 4e — T A C — C G — TG C — A A-C-A 

BNL7 4 k T-T G — T — A — C — A T C-A 

BNL8 4k C G C — A— T— T C-A 

BNL9 4 k — T — TA C — C G — T — A — C — A T C-A 

BNL10 4 k T C G— T — A— C — A T C-A 

BNL11 4k T C G — T — A — C — A TT C-A 

BNL12 41 C — C G C— C— A T T-C-A 

BE95 5a — T — T— T A TTCC — A — C — T A-A 

HK2 6a TC A C C— C— C CTG A 

FR1 7a TC-T C T-G AAC — C — C — T-TT A 

VN4 8a TC C — C C AGC— C — C — T — T A 

VN12 9a — T — TC-A C C — TAGC — C T AA 

NE98 10a ATG — A — T — C — C AG GGT C T — — C-G 
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HCV-1 la CGATGCCATCCTGCACACTCCGGGGTGCGTCCGTTGCGTTCGTGAGGGCA 

HCV-J lb G — CATG A C— C G— C C— G— A-T- 

HC-G9 lc GA-CCTG A TCTG — C T--G— C-A A~C 

BNL1 Id — G-ATG A TAC— A G— C G AT- 

BNL2 Id T- G-ATG T G-C— A T— G— C G AA 

FR2 If G — CAT T G— T N — G — C A- A — G — A 

HC-J6 2a G-C— TG C— GTC--C G AGAAA-T G- 

HC-J8 2b T — C — AG-T — C — TCT T--A A — T-AGAA TAATG 

S83 2c A-GA — AG-G — T — T T--A T-AG ACC-C— 

NE92 2d G TG-T — T GTC — C T T-AGGAGA 

BNL3 2e G — C — GG-G — T — TGT T--A--T C AGAA-AGCTC-G 

FR4 2f G — C — GG-G — C — TGT T — A — T C — T-AGA-GTCA — T- 

BNL4 2g G-GC — GG-G — T — TGT T — A — T G — T-AGTTGC 

BNL5 2h G TG-G— T GTC — T — A — T — T — A — T-AGA-GC-CCAA- 

BNL6 2i G — G G T GTC — T — A — T — T — C — T-AGT-GA A — 

NZL1 3a T T T A— C~C~T~A T--C-AG—C 

HCV-TR 3b A TG T TTA — C — A G~C CACAACC 

NE4 8 3c -C T T TTG — C — T A— C C-AAA-CAAT- 

NE274 3d T — A-T T TTG — A — T — T — G — C AATCA 

NE145 3e A TG TG— T— T T--C G-AGA-C 

NE125 3f TA T TG — C — C — T — G — C AC— C T- 

24 4a -C — CA A TTG A— C — T — GATGACT — G- 

Zl 4b GC-CCA A TTG— A T C — T— G — GAC — AG- 

GB358 4c GC-CCA A CTC--A TT-A— C GA-G-TT— G- 

DK13 4d TT-CCA T-A CTC A T GA-G — A — G- 

GB809 4e -A— CA T-A CTC — A A— C— T— GAAGACC— G- 

BNL7 4k -C— CA T CTC— A— T G— C GA-A G- 

BNL8 4 k -C-CCA T CT A— T G— C GA-AACT— G- 

BNL9 4k -C— CA T TCTC— A— T G— C GA-A-T G- 

BNL10 4k -C— CA T-AGCACT A— T G— C GA-A-T G- 

BNL11 4k -C— CA T CT A— A G— C GAAA A- 

BNL12 41 -C— CA T-A CTA — A T— A— C— T— GAAGACT— G- 

BE95 5a TA-CCTG A G-A— T— T G T— CATGACA— T- 

HK2 6a T-C-ATG T TTTG — T — A T-G T — GA-G-TC-ATG 

FR1 7 a GACCATG — A TCT A— T— T A— TA-CAAG-C G- 

VN4 8 a GACACTG — TT TTG— T T— A T— GAAGRT-RA— 

VN12 9a T-GCATG TCTC T C GAAGACC 

NE98 10a G— ATT C TTA— T— C— T C A— CTCT 
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Figure 1 continued 
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Z4 4a CCGGGCGCT— GCTTGA-TC-T-C— G--A--TG-G— CT-AA-G—A— 

Zl 4b CC CGCA — GTTAGA-TCCA-G — CA-G — TG-A — C A-G— G— 

GB358 4c AT-GGCGCT — GCTTGAATCC — C — GA TG-G A-G— A-- 

DK13 4d CTG — TGCT — GCTTGA-TCTT-GA G-G A-G — G — 

GB809 4e -T-GGTGCT — GCTCGA — CCT-G — G — C — TG-G C A-G A 

BNL7 4k AT-GGCGCG— ACTTGA-TCT— A — GA TG-G — CT — A-G G — 

BNL8 4 k AT-GGCGCA — GCTTGA-TCT — G — GA TG-G A-G— G— 

BNL9 4k AT-GGCGCA— GCTTGA-TCCT-G— GA TG-G— A-G— G— 

BNL10 4k AC-GCGGCG — GCTTGA-TCC — G — GA TG-G A-G— G— 

BNL11 4 k AT-GGCGCG— ACTTGA-TCT— A— GA TG-G G— A-G— G— 

BNL12 41 CTTTCGGCT— ACTT-T-TCCG-A — G — G — TG-G A-G— G— 

BE95 5a CT-GG-GCAGT-A— G-T-CT GA-AGC-G-T— CTAC— A-CG— 

HK2 6a -CTTCCACG A GGAT-C CA-G TG-G T CG— 

FR1 7 a TCATC-G-G— AATCCACGG-T C — A G-A — C— C— C— T— 

VN4 8a -CGTCTACG— A-TC— CGG-T-C— CAAA— TG-G— CA-CA-G— G— 

VN12 9a -CGTCGG-GT— ATC-G-GGTG-C— CGAG— G-G— C— CT-G— G— 

NE98 10a CC-TGCGC-G— A-CG-CTCT— C— CACG G-G— A— A-G— G— 
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Figure 1 - continued 

851 900 

HCV-1 la TCTTTCTTGTCGGCCAACTGTTCACCTTCTCTCCCAGGCGCCACTGGACG 

HCV-J lb -T C TC G A--TC-C— GT-TGA 

HC-G9 lc C T GA-C A T 

BNL1 Id C--C-CT G— A T — A C-CATG CAT— A 

BNL2 Id C G— A T--A C-CTTGT — CAT— A 

FR2 If C— C— T— G T A-GT— C G— T 

HC-J6 2a -GA-G CA-C GA TTG G— ACA— A TTT 

HC-J8 2b -GA-GA — C-ATCG — GGCT TGG-A— A— ACAA AACTTC 

S83 2c -GA-G — G-C— CT— GG-CG— GT-G-G — G— ACAA-A TAC-TTT 

NE92 2d -GA-GT-G-CTTCT G-C T-A G CA— AT— TAA-TTT 

BNL3 2e -GA-GA-A-CT-CA— GGCT T-G-GG-A—G-A T-ACTTC 

FR4 2f -GA-GA-A-CA-CG G-TGC-GT-G A — GCAATA TACTTTT 

BNL4 2g -GA-GA-A-CT-CT — GG-TG TTG G — GCAA-AT AACTTT 

BNL5 2h -GA-GT-G TCT T-T TGA C — TCA — A ATCTTC 

NZL1 3a C— G— A GCC G AGA— TC-A TCAA 

HCV-TR 3b -G G— A GC AGA— TC-C AC C 

NE4 8 3c -T— C— C— A— A GCA A AGA C-A CA A 

NE274 3d CT-G— G— A— GGCT AGA — TC-T-AG AAC 

NE145 3e C G— G— GGCC— T— A AGG— TC-T— T— TAC T 

NE125 3f -T — C G GC T AGAG-TC AA— T-AT— C 

2,4 4a C C — GA-G — G — GA — A T — TCGG — GC-T C 

Zl 4b C — A — G G GA CGA — GC-C — G C 

GB358 4c -A T-G— T— T— GA T-T CAG— GC T 

DK13 4d -G— CT-G T CAA— TC-C C 

GB809 4e -A — CT-G— A A CAA — GC-A 

BNL7 4k -G— C— A T— GA T-T CGA— A T 

BNL8 4k -G — CT-G — T — T — GA TT-T CGA — AC-A T 

BNL9 4k CG — CT-G — T — T — GA T-T CGA — AC C 

BNL10 4k -G— CT-G— T— T— GA T-T— YCAG— TC T 

BNL11 4k -G— C— G— T— T— GA T-T CGA— AC T 

BNL12 41 C C — A — G — G — GA CAG— GC-T T 

BE95 5a -A— CT-G— A A ATAGG— TC-C-AG GCT 

HK2 6a T-G-CG— A A TCAG— C-C— T— T T 

FRl 7a -AA-CT-G— A— G— G— T— T— T AGG— T-A-TA TCA-GTT 

VN4 8a -T— C— C— T— A— G— C GC— AGG— TC— ATG— TCA-GTT 

VN12 9a C — T — G — GT G AGA ATGT-TGA — TC 

NE98 10a -A Y— G— GGG T-A-GGAGA-ATC-C-AG— T T 
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Figure 1 - continued 

901 950 
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Figure 1 -continued 
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Figure 2 

1 50 
HCV1 la MSTNPKPQKKNKRNTNRRPQDVKFPGGGQIVGGVYLLPRRGPRLGVRATR 

HCV-J lb R-T 

BNL1 Id R-T XXXXX X 

BNL2 Id R-T X 

CAM1078 le R-T V A- 

FR2 If R-T 



HCJ6 2a R-T 

HCJ8 2b R-T 

CH610 2c R-T 

NE92 2d R-T 

BNL3 2e R-T 

FR4 2f R-T P~ 

HCVTR 3b L RQT L N V V- 

DK13 4d R-T M 

CAM600 4e R-T M 

GB809 4e L-R-T M 

BNL7 4 k R-T M 

BE 9 5 5a R-T M 

HK2 6a — L R-T T 

FR1 7a L R-T M 

VN4 8a L R-T 1 

VN13 8b L R-T 

VN12 9a L R-T M 

NE98 10a L R-T X V Q V- 
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Figure 2 - continued 

51 100 

HCV1 la KTSERSQPRGRRQPIPKARRPEGRTmQPGYPWPLYGNEGCGWAGWLLSP 

HCV-J lb M 

BNL1 Id X-X---S X 

BNL2 Id D QSD-XX H 

CAM1078 le E 

FR2 If S A 

HCJ6 2a D — ST-KS-GK L 

HCJ8 2b D — ST-KS-GK 

CH610 2c D — TT-KS-GR L 

NE92 2d D— T-KS-GK L 

BNL3 2e D-XAT — S-GR L 

FR4 2f D — AT-KS-GR L 

HCVTR 3b KQ-HL SR S K L 

DK13 4d QL S 

CAM600 4e T S 

GB809 4e S S 

BE95 5a Q-T--S-G A L 

HK2 6a Q-Q — H 

FR1 7a V-Q-T— S-G 

W4 8a V-HQT 

VN13 8b V-HQT 

VN12 9 a A V-QNQ 

NE98 10a S R T S 
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Figure 2 - continued 

101 150 

HCV1 la RGSRPSWGPTDPRRRSRNLGKVIDTLTCGFADLMGYIPLVGAPLGGAARA 

HCV-J lb ■ 

BNL1 Id N 

BNL2 Id 

FR2 If N S-T 

HC-J6 2 a N H V V V 

HC-J8 2b T H R 1 V V— V— 

CH610 2c H V V— V— 

NE92 2d H V y— V— 

BNL3 2e —XX X-V V— X 

FR4 2f N H X V V— V 

HCV-TR 3b N F V— V 

GB116 4c V— V— 

DK13 4d N V V— V 

CAM600 4e -X— X N X V— V 

GB809 4e N V— V 

G22 4f V--V 

GB54 9 4g V— V 

GB438 4h V— V 

BNL7 4 k N 

BE95 5a N N K G-I— V 

HK2 6a H N V V-A- 

FR1 7 a N N XXL VL-G V-A- 

VN4 8a N N V X--V-X- 

VN13 8b X N N X XX IE— 

VN12 9a D-X-N X E V V-AE 

NE98 10a N 
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Figure 2 - continued 
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LAHGVRVLEDGVNYATGNLPGCSFSIFLLALLSCLTVPASAYQVRNSTGL 

j E VS-I 

XT-HE AS-V 

F TT-HE AS-V 

-X XG— XXXXX — X XX X T E-HST-DG 

F I-T-V— AE-K-ISTG 

j v V — VE ISSS 

j s IS— V— VE-K-TSTS 

j 1 V-GL — K-TSSS 

—X I~X X V V-XVE-K-TSQA 

j _ 1 v — I — K-NSHF 

v v — V — K-TSTM 
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AV 1 T — VNY — AS -I 
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Figure 2 - continued 
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26/74 

continued 

251 300 

la GKLPATQLRRHIDLLVGSATLCSALYVGDLCGSVFLVGQLFTFSPRRHWT 

lb SSI-T-TI V A-A M S YE- 

Id ASV-TXAI V XX- F M — X A M-H- 

1 d ANV- TAAI V T -AFR — M LYH- 

1 f ANA- 1 DEV V A- VF M- 1 G TS 

2 a PGALTQG — T MV-M G -M-AA-M- IV- -QH — F 

2 b RGALTRS — T - V-MI -MA- - A V — A-MI L S - A-MV- - Q—NF 

2 c PGTLTKG - - A- V- VI -M V- -ALMI AA- AVI A- - Q— T F 

2d PGALTKG — T TIIA F 1 A-M- AS -V- II — QH-KF 

2 e PGALTKG — AR- -AV-M V- - A-MIAA-A- IVA-K- - YF 

2 f PGALTRG— A T I -M 1 A-MIAA-VAW— QY- TF 

2 g PGALTRG- - T T I -MV 1 - -V- - A-MI AA- WI V- - QH - NF 

2h PGALTRG— T TI-A V F — A-M — S - F-MI — QH - 1 F 

2i PGAXTKG — T II-A F 

3b LGVTTASI-T-V-M ARQ AF-A A R T- 

4 c VGA- LES—S - V — M- -A- - V 1 G M- S-Q 

4d LNA-LES V— M--G 1— V— G Q 

4e AGA-LEP V — M — A — M 1 GL M Q 

4 e VGA- LE P V- -M- - A- - V GL M Q 

4f LGA-LESM V — M — T GI — A — M R — L 

4 g VGA- LE SM V — M- - A — V 1 G M R 

4h LGA-L-SV-Q-V— M— A 1 — H — G A— MVS-Q 

4k IGA-LES— S-V— M— A— V I--X-XGL M-S-R 

4 k I GA- LES — S - V — M- -A — V 1 GL M- S-R 

4k IGA-LES— S-V— M— A— V 1 GA M-S-R 

4 k TAA- LES—S -V- -M- - A- - V I-X GL M- SXQ 

4 k I GA- LES—S -V-VM- -A — V 1 GL M- S-R 

41 L S A- LMSV V- -M- - A S GA M Q 

4 x VDA- LE S F V — M — A V GA M Q 

5 a LGAVTAP AV- Y- A- G - A A- - AL M- - YR- - Q- A- 

6a AST GF V A-A-W— S— I L— A Q 

7a SSV-IHGF V A-AF M-I II R-KY-QV 

8 a AST -V-GF-K-V- IM- -A-AF M GL LR- -M- QV 

9a ASVSIRGV-E-V A-AF— M GL R— MYEI 

10a PCAATAS— T-V-MM-XA AL — X — G-SWRH-Q 
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Figure 3 

SEQ ID NO. 1 (BNL1, Id) 

ATGAGCACGAATCCTAAACCTCAAAGAAAAACCAAACGTAACACCAACCGCCGCCCTCAKGGSGTN 
NNNNNNCCGGGTGGCGGTCAGATCGTTGGTGGAGTTTACCTGTTGCCGCGCAGGGGCCCCAGGNNG 
GGTGTGCGCGCGACTAGGAAGACTTCCGAGCGGTCACAACCTCGTGGCAGGCGACAGCCTATCCCC 
AAGGCTCGYCGGYCCGAGGGCAGGTCCTGGGCTCAGCCCGGGTATCCTTGGCCCCTCTATGGCAAT 
GAGGGCTGCGGGTGGGCGGGNTGGCTCCTGTCCCCCCGCGGCTCTCGGCCCAATTGGGGCCCC 

SEQ ID NO. 3 (BNL1, Id) 

GACGGCGTGAACTATGCAACAGGGAACTTGCCCGGTTGCTCTTTCTCTATCTTCCTCTTGGCTTTG 
CTGTCCTGCTTGACGGTTCCAACKACCGCTCACGAGGTGCGCAACGCATCCGGGGTGTATCATGTC 

AC C AAC GAC T GT T C C AAC T C GAGCAT CAT C TAT GAGAT GGAC GGTAT GAT CAT GCAC TAC C CAGGG 
TGCGTGCCCTGCGTTCGGGAGGATAACCATCTCCGCTGCTGGATGGCGCTCACCCCCACGCTTGCG 
GTCAAAAAYGCTAGTGTCCCCACTRCGGCAATCCGACGTCACGTCGACTTGCTTGTTGGGGGNNCC 
ACGTTCTGTTCCGCTATGTACGTGGGRGACCTTTGCGGGTCTGTCTTCCTCGCTGGCCAGCTATTC 
ACCTTTTCACCCCGCATGCACCATACAACGCAGGAGTGCAACTGCTCAATC 

SEQ ID NO. 5 (BNL2, Id) 

aTGAGCACGAATCCTAAACCTCAAAGAAAAACCAAACGTAACACCAACCGCCGCCCACAGGACGTC 
AAGNTCCCGGGTGGTGGTCAGATCGTTGGTGGAGTTTACCTGTTGCCGCGCAGGGGCCCCAGGTTG 
GGTGTGCGCGCGACCAGGAAGACTTCCGAGCGGTCGCAGCCTCGTGACAGGCGACAGCCTATTCCT 
AAGGCTCGCCAGTCCGATGGCAGNNCCTGGGCTCAGCCAGGGCATCCCTGGCCCCTCTATGGCAAT 
GAGGGCTGCGGATGGGCGGGATGGCTCCTGTCCCCCCGCGGCTCTCGGCCCAGTTGGGGCCCC 

SEQ ID NO. 7 (BNL2, Id) 

GACGGCGTGAACTATGCAACAGGGAATTTGCCTGGTTGCTCTTTCTCTATCTTCCTCTTAGCTTTT 
CTGTCCTGCTTGACGGTTCCAACTACCGCTCATGAGGTGCGCAACGCATCCGGGGTATATCATCTC 
ACCAATGACTGTTCCAACTCGAGCATCATCTATGAGATGAGTGGTATGATCTTGCACGCCCCAGGG 
TGTGTGCCCTGCGTTCGGGAGAACAACTCTTCTCGTTGCTGGATGCCRCTCACCCCCACGCTTGCG 

G T C AAAGAC GC T AAT GT C C C TAC T GC G GC AAT C CGACGCCAT GT C GAC TTGCTGGTT GGGACAGC C 
GCGTTTCGTTCCGCTATGTACGTGGGGGACCTCTGCGGATCCGTCTTCCTTGTCGGCCAGCTATTC 

AC C T T T T C AC C C C GC T T GT AC C AT AC AAC AC AGGAGT GCAAC T GC T CAAT C 
SEQ ID NO. 9 (CAM1078, le) 

ATGAGCACGAATCCTAAACCTCAAAGAAAAACCAAAAGAAACACCAACCGCCGCCCACAGGACGTC 
AAGTTCCCGGGCGGTGGCCAGATCGTTGGTGGAGTCTACGTGCTACCGCGCAGGGGCCCTAGATTG 
GGTGTGCGCGCAGCGCGGAAGACTTCGGAGCGGTCGCAACCTCGTGGGAGGCGCCAACCTATTCCC 

AAGGAGC GC C GAC C CGAGGGCAGGT 
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SEQ ID NO. 11 (FR2, If) 

AT GAGC AC GAAT C C T AAAC C T C AAAGAAAAAC CAAAC GCAACAC CAAC.CGCC GCCC ACAGGAC GT T 

AAATTCCCGGGTGGGGGGCAGATCGTGGGTGGAGTTTACTTGTTGCCGCGCAGGGGCCCCAGGTTG 

GGTGTGCGCGCGACGAGGAAGACTTCCGAGCGGTCGCAACCTCGCGGAAGGC 

GACAGCCTATCCCCAAGGCTCGCCGACCCGAGGGCAGGTCCTGGGCTCAGCCTGGGTACC 

CATGGCCCCTCTATGCTAACGAGGGCTGCGGATGGGCGGGATGGCTCCTGTCCCCTCGCG 

GCTCCCGTCCTAGCTGGGGCCCCAATGACCCCCGACGTAGATCACGCAATTTGGGTAAGG 

TCATCGATACCCTAACGTGTGGCTTCGCCGATCTCATGGGGTACATTCCGCTCGTCGGCGC 

CCCCCTAGGGGGCGCTTCCAGAACCCTGNCACATGGTGTCCGGGTCCTGGNAGGCGGCGTGATNNN 

NNNNNNNNNNAACCTTCCNGGTTGCTCTTTNNCTATCTTCCTCTTGGCNTTACTCTCTTGCCTCAC 

AGTCCCCACCTCTGCCTATGAGGTGCACAGCACAACCGATGGCTACCATGTCACTAATGACTGTTC 

CAACGGCAGCATCGTATATGAGGCAAAGGACATCATCCTTCACACGCCTGGGTGNGTGCCCTGCAT 

ACGGGAAGGCAATATCTCCCGTTGCTGGGTACCGCTCACCCCCACGCTCGCAGCGCGGATCGCGAA 

CGCTCCCATCGATGAGGTGCGGCGTCACGTCGACCTCCTCGTGGGGGCAGCCGTGTTCTGCTCAGC 

CATGTACATTGGGGACCTTTGTGGGGGCGTCTTCCTCGTTGGGCAATTGTTCACCTTCACGTCCCG 

GCGGCATTGGACGGTGCAGGACTGTAATTGTTCCATTTACTCTGGCCACATAACGGGCCACCGNISJN 

NNNN 

SEQ ID NO. 13 (BNL3, 2e) 

ATGAGCACAAATCCTAAACCTCAAAGAAAAACCAAAAGAAATACCAACCGCCGCCCACAGGACGTC 
AAGTTCCCGGGCGGCGGCCAGATCGTTGGCGGAGTTTACTTGTTGCCGCGCAGGGGCCCCAGATTG 
GGTGTGCGCGCGACGAGAAAGACTTCTGAACGGTCCCAGCCACGTGGAAGGCGCCAGCCCATCCCT 
AAAGATCGGNGNGCCACTGGCAGGTCCTGGGGACGTCCAGGATATCCCTGGCCCCTGTATGGGAAC 
GAGGGGCTCGGCTGGGCAGGATGGCTCCTGTCCCCCCGAGGCTCTC 

SEQ ID NO. 15 (BNL3, 2e) 

ACGTGCGGNTNTGCCGACCTCATGGGGTACATNCCCGTTGTCGGCGCCCCGGTGGGCGGGGTNGC 

CAGGGCCC TCGCGNATGGCGTGCGGGTCCTGGAGGACGGGATAAATTATGNAACAGGGAACCTCCC 

TGGTTGCTCCTTTTCTATCTTCTNGTTGGCTCTTCTGTCTTGTGTCACCGTGCCTGTCTCTGNCGT 

TGAGGTCAAAAATACCAGTCAGGCCTATATGGCAACCAACGACTGCTCCAACAACAGCATCGTATG 

GCAATTGGNGGACGCGGTGCTTCATGTTCCTGGATGTGTCCCCTGCGAGAATAGCTCCGGTCGGTT 

CCACTGTTGGATCCCGATCTCGCCCAACATAGCCGTGAGCAAACCTGGTGCTCTCACCAAGGGACT 

GCGGGCACGCATTGATGCCGTCGTGATGTCCGCCACCCTCTGCTCTGCCCTGTACGTGGGAGATGT 

GTGCGGCGCAGTGATGATAGCTGCACAGGCTTTCATCGTGGCACCGAAGCGCCATTACTTCGTCCA 

GGAAT GC AAT T GC T C C AT ATAC C CAGGCC AC AT T ACAGG T CAT C GC AT GGC G 

SEQ ID NO. 17 (FR4, 2f) 

ATGAGCACAAATCCTAAACCTCAAAGAAAAACTAAAAGAAACACTAACCGTCGCCCACAGGAC 

GTTAAGTTCCCGGGCGGCGGCCAGATCGTTGGCGGAGTTTACTTGTTGCCGCGCAGGGGCCCCAG 

GTTGGGTGTGCGCGCGC CAAGGAAGACTTCTGAACGGTCCCAGCCACGTGGAAGGCGCCAGCCC 

ATCCCAAAAGATCGGCGCGCCACTGGCAAGTCCTGGGGACGTCCAGGATACCCTTGGCCCCTGT 

ACGGGAACGAGGGCCTCGGCTGGGCAGGGTGGCTCCTGTCCCCCCGGGGCTCTCGCCCCTCGTG 

GGGCCCAAACGACCCCCGGCACAGGTCACGCAACTTGGGTAAGGTCATCGATACCCTCACGTG 

TGGCTTTGSCGACCTCATGGGGTACATACCTGTCGTCGGCGCCCCTGTGGGCGGCGTTGCCAGA 

GCCCTCGCGCATGGCGTGCGGGTCCTGGAGGACGGGATAAATTATGCAACAGGGAACTTGCCCGGT 

TGCTCCTTTTCTATCTTCTTGCTGGCTCTCTTGTCTTGTATCACCGTGCCCGTGTCTGCCATACAG 

GTTAAGAACAACAGCCACTTCTACATGGCGACTAATGACTGTGCCAATGACAGCATCGTCTGGCAG 

CTCAGGGACGCGGTGCTCCATGTTCCTGGATGTGTCCCCTGTGAGAGGTCAGGTAATAGGACCTTC 

TGTTGGACAGCGGTCTCGCCCAACGTGGCTGTGAGCCGACCTGGTGCTCTCACTAGAGGTCTGCGG 

GCTCACATTGATACCATCGTGATGTCCGCCACCCTCTGCTCTGCCCTATACATAGGGGACCTATGC 

GGCGCTGTGATGATAGCAGCGCAAGTTGCCGTCGTCTCACCGCAATACCATACTTTTGTCCAGGAA 

T GC AAC T G C T C C ATAT AC C CAGGCC AT AT C AC AGGACAT CGAAT GGNN 
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SEQ ID NO. 19 (BNL4, 2g) 
GACGGGGTAAATTATGCAACAGGGAATCTGCCTGGTTGCTCTTTCTCTATCTTCTTGTTGGCTCTT 
CTGTCTTGTGTCACCGTGCCTGTCTCTGCCGTGCAGGTTAAGAACACCAGTACCATGTACATGGCA 
ACCAATGACTGTTCCAACAACAGCATCATCTGGCAAATGCAGGGCGCGGTGCTTCATGTTCCTGGA 
TGTGTCCCGTGTGAGTTGCAGGGCAATAAGTCCCGGTGCTGGATACCGGTCACTCCCAACGTGGCT 
GTGAACCAGCCCGGCGCCCTCACTAGGGGCTTGCGGACGCACATTGACACCATCGTGATGGTCGCT 
ACGCTCTGTTCTGCACTCTACATCGGGGACGTGTGTGGCGCGGTGATGATAGCTGCTCAGGTTGTC 
ATTGTCTCGCCGCAACATCACAACTTTTCCCAGGATTGCAATTGTTCCATC 

SEQ ID NO. 21 (BNL5, 2h) 

ATGAGCACAAATCCTAAACCTCAAAGAAAAACCAAAAGAAACACTAACCGCCGCCCACAGGACGTT 
AAGTTCCCGGGCGGTGGCCAGATCGTTGGCGGAGTATACTTGTTGCCGCGCAGGGGCCCCCGGTTG 
GGTGTGCGCGCGACGAGGAAAACTTCCGAACGGTCCCAGCCACGTGGGAGGCGCCAGCCCATCCCT 
AAAGATCGGCGCTCCACTGGCAAATCCTGGGGACGTCCAGGATACCCTTGGCCCCTGTATGGGAAC 
GAGGGCCTTGGTTGGGCAGGATGGCTCTTGTCCCCTCGAGGCTCTC 

SEQ ID NO. 23 (BNL5, 2h) 

GACGGGATAAACTACGCAACAGGGAATCTGCCCGGTTGCTCCTTTTCTATCTTCTTGCTGGCCTTG 
CTATCCTGTCTCACTGTGCCGGCGTCCGCTGTGCAGGTCAAGAACACCAGCCACTCTTATATGGTG 
AC CAAT GAT T G C T CAAAC AGCAGC AT T G T C T GGCAGC T T AAGGAT GCTGTGCTT CAC GT C C CT GGA 
TGTGTTCCATGTGAGAGGCACCAAAATCAGTCTCGCTGCTGGATACCTGTGACACCCAATGTGGCC 
GTGAGCCAACCTGGCGCGCTCACCAGGGGTTTGCGGACGCACATTGACACCATCGTTGCGTCTGCT 
ACCGTCTGCTCAGCTTTGTATGTGGGCGACTTCTGCGGCGCAGTGATGTTGGTCTCTCAATTTTTC 
ATGATCTCCCCTCAGCACCACATCTTCGTCCAGGATTGCAACTGCTCGATA 

SEQ ID NO. 25 (BNL6, 2i) 

GACGGGATAAACTATGCAACAGGGAACCTGCCTGGTTGCTCCTTTTCTATCTTCTTACTGGCCCTG 
CTTTCTTGCATCACCGTGCCGGTCTCTGCCGTGCAAGTTGCGAACCGCAGTGGTTCTTACATGGTG 
ACCAATGATTGCTCGAACAGCAGCATCGTTTGGCAGCTCGAGGAGGCCGTCCTTCACGTCCCTGGA 
TGTGTTCCCTGTGAGTGGAAGGACAACACCTCCCGCTGCTGGATACCGGTCACCCCTAACATCGCT 
GTGAGCCAACCTGGCGCGCTTACCAAGGGCCTGCGGACACATATTGACATCATTGTCGCGTCCGCC 
ACGTTCTGCTCTGCCTTGTATGTGGG 

SEQ ID NO. 27 (BNL7, 4k) 

ATGAGCACGAATCCTAAACCTCAAAGAAAAACCAAACGTAACACCAACCGCCGCCCCATGGACGTT 
AAGTTCCCGGGTGGTGGCCAGATCGTTGGCGGAGTTTACTTGTTGCCGCGCAGGGGCCCCAGGTTG 
GGTGTGCGCGCGACTCGGAAGACTTCGGAGCGGTCGCAACCTCGTGGGAGACGCCAACCTATCCCC 
AAGGCGCGTCGATCCGAGGGAAGGTCCTGGGCACAGCCAGGATATCCATGGCCTCTTTACGGTAAT 
GAGGGTTGCGGGTGGGCANNATGGCTCTTGTCCCCCCGCGGTTCTC 

SEQ ID NO. 29 (BNL7, 4k) 

GACGGGATCAATTTTGCAACAGGGAACCTCCCCGGTTGCTCCTTTTCTATCTTCCTCTTGGCACTC 
CTCTCGTGCCTGACTGTCCCCGCTTCGGCCATCAACTATCGCAATGTCTCGGGCATTTACTATGTC 
ACCAATGATTGCCCGAATTCAAGCATAGTGTATGAGGCCGACCATCACATCTTGCACCTCCCAGGT 
TGCGTGCCCTGCGTGAGAGAGGGGAATCAGTCACGTTGCTGGGTAGCCCTTACCCCTACCGTCGCA 
GCGCCATACATCGGCGCGCCACTTGAGTCTCTACGGAGTCATGTGGACTTGATGGTGGGGGCCGCC 
ACTGTTTGTTCAGCCCTTTACATCGGGGATTTRTGTGGYGGCTTGTTCCTAGTCGGTCAGATGTTC 
TCTTTCCGACCAAGGCGCCACTGGACTACTCAAGATTGCAATTGTTCCATC 
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SEQ ID NO 31 (BNL8, 4k) 

GACGGGATCAATTATGCAACAGGGAACCTTCCCGGTTGCTCTTTTTCTATCTTCCTCTTGGCACTC 
CTCTCGTGCCTGACTGTTCCCGCTTCGGCCATTAACTACCGCAACACCTCGGGCATCTACCACGTC 
ACCAATGACTGCCCGAACTCGAGCATAGTTTATGAGGCCGACCACCACATCTTGCACCTTCCAGGT 
TGCGTGCCCTGCGTGAGAACTGGGAATCAGTCACGTTGCTGGGTGGCCCTTACTCCTACCGTCGCA 
GCGCCATACATCGGCGCACCGCTTGAGTCTCTGCGGAGTCATGTGGATCTGATGGTGGGGGCTGCC 
ACTGTTTGCTCAGCCCTTTACATCGGGGATTTGTGTGGCGGCTTGTTCTTGGTTGGTCAGATGTTT 
TCTTTCCGACCACGACGCCACTGGACTGCCCAGGATTGCAATTGTTCTATC 

SEQ ID NO. 33 (BNL9, 4k) 

GACGGGATTAATTATGCAACAGGGAATCTTCCCGGTTGCTCCTTTTCTATCTTCCTCTTGGCACTT 
CTCTCGTGCCTGACTGTCCCCGCTTCGGCCATTAACTACCACAACACCTCGGGCATCTATCATATC 
AC CAACGAC T GCCCGAAT TCAAGCATAGTGTATGAGGCCGACCATCACATCTTGCATC TCCCAGGT 
TGCGTGCCCTGCGTGAGAGTGGGGAATCAGTCGAGTTGCTGGGTGGCCCTTACCCCTACCATCGCA 
GCGCCATACATCGGCGCACCGCTTGAGTCCTTGCGGAGTCATGTGGATCTGATGGTGGGGGCGGCC 
ACTGTCTGTTCAGCCCTTTACATCGGGGATTTGTGTGGCGGTGCGTTCTTGGTTGGTCAGATGTTC 
TCTTTCCGACCACGGCGCCACTGGACCACCCAAGATTGCAACTGCTCCATC 

SEQ ID NO. 35 (BNL10, 4k) 

GACGGGATCAATTATGCAACAGGGAATATTCCCGGTTGCTCYTTTTCTATCTTCCTTYTGGCACTT 
CTCTCGTGTCTGACTGTCCCCGCTTCGGCCACTAACTATCGCAACGTCTCGGGCATCTACCATGTC 
ACCAATGACTGCCCGAATTCAAGCATAGTGTATGAGGCCGACCATCACATCTTAGCACTTCCAGGT 
TGCGTGCCCTGCGTGAGAGTGGGGAACCAGTCACGCTGCTGGGTGGCCCTTACCCCTACCGTCGCA 
GCGCCATACACCGCGGCGCCGCTTGAGTCCCTGCGGAGTCATGTGGATCTGATGGTGGGAGCTGCC 
ACTGTTTGTTCAGCCCTTTACATCGGGGAYTTGTGTGGCGGCTTGTTCTTGGTTGGTCAGATGTTC 
TCTTTYCAGCCTCGGCGCCACTGGACTACCCAGGATTGCAATTGTTCCATC 

SEQ ID NO. 37 (BNL11, 4k) 

GACGGGATTAATTATGCAACAGGGAAYCTCCCCGGTTGCTCTTTTTCTATCTTCCTCTTGGCACTT 
CrCTCGTGCCTGACTGTCCCCGCTTCGGCCACCAACTACCGCAATGTCTCGGGCATTTACCATGTC 
ACCAATGACTGCCCGAATTCAAGCATAGTGTTTGAGGCCGACCATCACATCTTGCACCTTCCAGGA 
TGCGTGCCCTGCGTGAAAGAGGGAAATCATTCACGCTGCTGGGTGGCCCTTACCCCTACCGTCGCA 
GCGCCATACATCGGCGCGCCACTTGAGTCTCTACGGAGTCATGTGGATGTGATGGTGGGGGCTGCC 
ACTGTTTGTTCAGCCCTTTACATCGGGGATCTGTGCGGTGGCTTGTTCCTGGTTGGTCAGATGTTC 
TCTTTCCGACCACGGCGCCACTGGACTACCCAGGAATGCAATTGTTCCATC 

SEQ ID NO. 39 (BNL12, 41) 

GACGGGATCAATTATGCAACAGGGAACCTCCCCGGTTGCTCTTTCTCTATCTTCATCCTGGCACTT 
CTCTCGTGCCTGACTGTCCCGGCCTCGGCTCAGCATTATCGGAATGTCTCGGGCATTTACCACGTC 
ACCAACGACTGCCCGAACTCCAGCATAGTGTATGAGTCCGACCATCACATCTTACACCTACCAGGG 
TGTGTACCCTGTGTGAAGACTGGGAACACTTCGCGCTGCTGGGTGGCCTTAACACCTACCGTGGCC 
GCGCCCATACTTTCGGCTCCACTTATGTCCGTACGGCGGCATGTGGATCTGATGGTGGGTGCAGCT 
ACCCTATCGTCTGCCCTCTACGTTGGAGACCTCTGCGGGGGTGCCTTCCTAGTGGGGCAGATGTTC 
ACCTTCCAGCCGCGTCGCCACTGGACTGTCCAAGACTGCAACTGTTCCATC 

SEQ ID NO. 45 (VN13, 7a) 

ATGAGCACACTTCCTAAACCTCAAAGAAAAACCAAACGAAACACCAACCGTCGCCCACAGGACGTC 
AAGTTCCCGGGTGGCGGTCAGATCGTTGGTGGAGTTTACTTGTTGCCGCGCAGGGGCCCTCGTTTG 
GGTGTGCGCGCGACGAGGAAAACTTCTGAACGGTCCCAGCCCAGGGGTAGACGCCAACCTATACCG 
AAGGTGCGTCACCAAACGGGCCGTACCTGGGCTCAACCCGGGTACCCCTGGCCTCTTTATGGGAAT 
GAGGGTTGTGGCTGGGCAGGGTGGCTCCTGTCCCCCCNCGGCTCTCGCCCTAATTGGGGCCCTAAT 
GACCCCCGGNGGAGGTCCCGCAACCTGGGTAAGGTCATCGATACCCTTACTTGNGGSTTCGCCGAC 
CTCATAGAGTACATTCC 
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mLgcacacttccaaa^ 

GGACGTCAAGTTCCCGGGTGGCGGCCAGATCGTTGGTGGAGTCTACTTGCTGCCGCGCAG 

GGGC C C GCGC T T GGGT GT GC GC GC GAC GAGAAAGAC T T C T GAACGGT CCCAGC CCAGAGG 

TAGGCGCCAACCAATACCCAAAGTGCGCCACCAAACGGGCCGTACCTGGGCCCAGCCCGG 

GTACCCCTGGCCTCTTTATGGAAATGAGGGCTGTGGTTGGGCAGGCTGGCTCCTGTCCCC 

CCGCGGCTCTCGCCCAAATTGGGGCCCAAACGACCCCCGGCGGAGGTCCCGCAACTTGGG 

TAAAGTCATCGACACCCTTACTTGCGGCTTCGCCGACCTCATGGGGTATATCCCTGTCGTAG 

GCGCTCCGWT^^GA^^CGTCGCGGNGGCCTTGGCGCATGGGGTCANGGNCATCGAGGACGGNGTAA 

attacgcaacagnSaatcttcccggnngct 

TTACAACACCAGCCTCCGCGGCGCATTATACCAACAAGTCTGGCCTGTACCATCTCACCAACGACT 

GCCCCAACAGCAGCATCGTTTATGAGGCGGAGACACTGATTTTGCACTTGCCTGGGTGTGTAC 

GTGTGAAGRTGRACAATCAATCCCGGTGCTGGGTGCAGGCCTCCCCGACCCTGGCAGTGCCGAACG 

?g??ta^gccagtca^cgg1ttccgcaaacatgtggaca^ 

CAGCTATGTATGTGGGGGACCTGTGCGGGGGCCTTTTCCTCGTTGGACAGCTCTTCACGCT^ 
CTCGGATGCATCAGGTTGTCCAGGAGTGTAACTGTTCCATCTACACAGGGCATATCACTGG 

GAATGGCA 

AT GAG C AC AC TTC C AAAAC C C CAAAGAAAAAC CAAAAGAAACAC AAAC C GT C GC C C AAT GGAT GT C 
A^GT^TCCCGGGCGGCGGTCAGATCGTTGGTGGAGTCTACTTGTTACCGCG 

gStgtgcgcgcgacgaggaagacttcggaacggtcccaggccagaggtaggcgccaac^ 

AAGGTGCGCCAGAACCAAGGCCGAACCTGGGCTCAGCCTGGGTACCCCTGGCCCCTTTATGGGAAC 

^ggg?tgcgg?^ggc^gggt^ 

S^C C C C C C T~^ w 
TGGAGGCG^ 

AGGGA^TCTTCCTGGTTGCTCTTTCTCTAT 

TGC C TC C GCAC T AAAC TAT GC T AAC AAGT C T GGGC T GTAT CATC TAAC CAAT GAC T GC C C CAATAG 

cagcmtg^a^ggcgaItggcatgat 

rrGC^^CCTGACCAAGTGTTGGCTGTCGGCCTCCCCGACATTGGCGGTGCAGAATGCGTCGGTGTC 

CATCA^GGGTGTCCGCGAGCACGTG 
CGTGGGCGACTTATGCGGTG 

ATGAGCACAC TTC CAAAACCCCAAAGAAAAAC CAAAAGAAAT ACTAACC GT CGC C C TAT GGAC 
GT CAAGT T C C C GGGCGGCGGCCAGAT CGT T GGT GGAGT T TAC T T GT TGCCGCGCAGGGGC 
CCTCGTTTGGGTGTGCGCGCGACGAGAAAGACCTCCGAACGGTCCCAGCCTAGAGGCAGG 
CGCCAGCCCATACCAAAGGTACGCCAGCCGACAGGCCGTAGCTGGGGTCAACCCGGCTAC 

c?SSgcccct¥tatggcaa^ 

GGGTCTCGTCCTAATTGGGGCCCCAACGACCCCCGGCGAAGGTCCCGCAACTTGGGTAAG 
PTCATCG^TACCCTTACATNCGGNCTAGCCGACCT 

CGCA^CAGGGA^TCTTCCTGG^ 

taca^cagcctcI^ 

GAAC^AA^AGC AT C GT TT T T GAGGC G GAGACCAT GATAC T GCAT C T T CCAGGT TGT GT C C CAT GTAT 

CAArrCGGGGAATGAGTC^ 

A^TGCCA^TCC^CGGGTTTCGCC 

C^TGTAC^TCGGAGACCTCTGTGGTAGCATAATCTTGG 

gtacca5c!ggttacc^ 

GGCA 
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SEO ID NO. 49 (NE98, 10a) 

AT GAGCACAC T TC C TAAAC CACAAAGAAAAACCAAAAGAAACACCAACC ? CCGGCCACAGGACGT T 
AAGTTCCCAGGCGGCGGTCAGATCGTTGGTGGAGTTTACGTGCTACCACGCAGGGGCCCCCAGTTG 
GGTGTGCGTGCAGTGCGCAAGACTTCCGAGCGGTCGCAACCTCGCAGTAGGCGCCAACCCATCCCC 
AGGGCGCGCCGAACCGAGGGCAGGTCCTGGGCTCAGCCCGGGTACCCTTGGCCCCTATATGGGAAT 
GAGGGCTGCGGGTGGGCAGGGTGGCTCCTGTCCCCGCGCGGCTCTC 

GACGGAATTAATTT ( C N GCAA^ 

T T C T C AT GC T T GC T T AC AC C CAC AGC C GGGC T GGAGTAC CGTAAT GC C T CC GGAC T C T ACAT GGTA 
ACTAACGACTGCAGTAACGGTAGTATCGTGTATGAGGCCGGGGATATTATCCTCCACTTACCTGGC 
TGTGTCCCCTGCGTACGCTCTGGCAATACATCAAGATGCTGGATCCCTGTGAGCCCYACCGTCGCC 
GTGAAGTCGCCCTGCGCCGCCACCGCCTCTCTCCGCACGCACGTGGATATGATGGTGGGRGCGGCC 
ACCCTATGCTCAGCTCTCTACGTAGGAGACCTTTGTGGAGCGCTATTTCTTGTYGGGCAGGGGTTC 

T CAT GGAGACAT C GC CAGCAT T GGAC T GT C CAGGAC T GCAAC T GT T CCAT C 

ctcgacagtt/ctgaIStgacatccg 

CGAGGCTCGCAAGGCCATAAAGTCGCTCACCGAGCGGCTGTACATCGGGGGCCCYCTAACCAATTC 

aaaaggacagaactgcggctaccgtcggtgccgcgccagcggcgtgctgactaccagctgcggcaa 

CACCCTGACATGCTACTTGAAAGCCAGAGCGGCCTGTCGAGCTGCAAAGCTCCGGGACTGCACCAT 
GCTCGTGTGCGGGGATGACCTTGTCGTTATCTGTGAGAGTGCGGGAGTCGAGGAAGACGCGGCGAA 

CCTACGAGCT 
CTCGACAGTTACTGAG^^ 

YGAGGCCCGCAAGGCCATAAAGTCGCTCACCGAGCGGCTGTACGTCGGGGGCCCCCTAACCAATTC 
AAAGGGGCAGAACTGCGGCTATCGTCGGTGTCGCGCTAGCGGCGTGCTGACCACCAGCTGCGGCAA 

CAC C C T CAC AT GC T AC T T GAAAGC CAGGGC GGC C TGT CGAGC T GCAAAGC T CCAGGAC T GCAC GAT 
GCTCGTGTGCGGAGACGACCTTGTCGTTATCTGTGAGAGCGCGGGAGTCGAGGAGGACGCGGCGAA 

CCTACGAGTC 
CTCGACAGTTACTGAG^ 

CGAGGCCCGCAAGGCCATAAAGTCGCTCACCGAGCGGCTGTATATCGGGGGTCCCCTAACCAACTC 
AAAAGGGCAGAACTGCGGCTACCGTCGGTGCCGCGCCAGCGGCGTGCTGACTACCAGCTGCGGTAA 
TACCCTCACATGTTACTTGAAAGCCAGGGCGGCCTGTCGAGCTGCGAAGCTCCAGGACTGCACAAT 
GC T C GT GT GC GGAGAC GAC C T T GT CGT T AT CT GTGAGAGTGCRGGAGTCGAGGAGGATGCGGCGAA 
CCTACGAGTC 

SEQ ID NO. 59 (CAMl078,le) 

CGTACAGCCTCCAGGACCCCCCCTCCCGGGAGAGCCATAGTGGTCTGCGGAACCGGTGAG 
TACACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGATCAACCCGCTCAATGCCTGGA 
GATTTGGGCGTGCCCCCGCAAGACTGCTAGCCGAGTAGTGTTGGGTCGCGAAAGGCCTTG 
TGGTACTGCCTGATAGGGTGCTTGCGAGTGCCCCGGGAGGTCTCGTAGACCGTGCACCAT 

GAGC AC G AAT C C TAAAC C T CAAAGAAAAAC C AAAAGAAACAC CAAC C GC C GC C C AC AGGA 
CGTCAAGTTCCCGGGCGGTGGCCAGATCGTTGGTGGAGTCTACGTGCTACCGCGCAGGGG 
CCCTAGATTGGGTGTGCGCGCAGCGCGGAAGACTTCGGAGCGGTCGCAACCTCGTGGGAG 
GCGCCAACCTATTCCCAAGGAGCGCCGACCCGAGGGCAGGTCCTGGGCGCAGCCCGGGTA 
CCCCTGGCCCCTCTATGGTAACGAGGGCTGCGGGTGGGCAGGTNGGCTCCTGTCCCCTCG 
CGGCTCCCGTCCTAGTTGGGGTCCTACTGACCCCCGGCGTAGGTCACGCAATTTGGGTAA 
GGTCATCGATACCCTCACGTGTTGNTTCGCCGACCTCATGGGGTACATACCG 
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SEQ ID NO. 61 (CAM1078, le) 

C T C AAC G G T C AC T GAAGC T GAT AT C C GAACAGAGGAG TC C ATAT AC C AAT GC T GT GAC C T GC AC C C 
CGAAGCACGTGTAGCCATCAAGTCTTTGACTGAAAGGCTGTACGTCGGGGGGCCCTTGACCAATTC 
AAAAGGGGAGAACTGCGGCTATCGCAGATGCCGTGCCAGCGGCGTCTTGACAACCAGCTGCGGCAA 
CACCCTCACCTGCTATATCAAGGCCCTAGCAGCCTGTAGAGCTGCCAAGCTCCAGGACTGCACCAT 
GCTCGTCTGTGGCGACGACCTGGTCGTGATCTGCGAGAGTGTAGGGACCCAGGAGGATGCGGCGAG 
CCTGCGAGCC 



SEQ ID NO. 63 (FR2, If) 

NTCAACAGTCACTGAGAGTGATATCCGTACAGAGGAGTCCATCTACCAATGCTGTGATCTAGACCC 

CGAGGCTCGCAAGGCCATAAGGTCCCTCACAGAGAGGCTTTATATCGGGGGTCCCCTGACAAACTC 

AAAAGGGCAGAACTGCGGCTACCGCCGATGCCGTGCAAGCGGCGTCCTGACGACTAGCTGCGGCAA' 

CACCCTCACCTGTTACATAAAGGCCAGGGCAGCCTGTCGAGCTGCGAAGCTCCAGGATTGCTCAAT 

GCTCGTCTGTGGCGACGACCTTGTCGTTATCTGCGAGATCGAGGGGNTCCANGAGGATCCGTCGAN 

NNNNNNNNNN 



SEQ ID NO. 65 (FR16,lg) 

CGTAGACCGTGCACCATGAGCACGAATCCTAAACCTCAAAGAAAAACCAAACGTAACATC 
AACCGCCGCCCACAGGACGTCAAGTTCCCGGGCGGTGGCCAGATCGTCGGTGGAGTTTAC 
CTGTTGCCGCGCAGGGGCCCTAGATTGGGTGTGCGCGCGACTAGGAAGACTTCCGAGCGG 
TCGCAACCTCGTGGGAGGCGACAGCCTATCCCCAAGGCTCGCCGATCCGAGGGCAGGTCC 
TGGGCTCAGCCCGGGTACCCTTGGCCCCTCTATGGCAATGAGGGCATGGGTTGGGCAGGG 
TGGCTCCTGTCCCCCCATGGCTCCCGGCCTAGTTGGGGCCCTTCAGACCCCCGGCGTAGG 
TCGCGTAATTTGGGTAAGGTCATCGATACCCTCACATGCGGCTTCGCCGACCTCATGGGG 
TACATTCCGCTCGTCGGCGCCCCCCTAGGGGGCGTTGCCAGGGCCCTGGCGCAAGGCTTC 
CGGGATCTACCACGTCACCAACGATTGTTCCAATGGGAGCATTGTGTATGAGGCGGAAGG 
CATGATCATGCATCTCCCCGGGTGCGTGCCCTGCGTTCGGGAAGGTAATATCTCTCGTTG 
CTGGGTACCGTTTTCCCCCACGCTCGCAGCCAGGAATGCTAGCGTCCCCACTCAGGCAAT 
TCGGCGACACGTCGACTTGCTTGTTGGGGCGGCCACACTCTGTTCTGCTATGTATGTGGG 
GGACCTCTGTGGGTCCGTCTTCCTCGTCGGCCAACTGTTCACCTTCACAWCCCGCCAGNA 
C T ACACAGT GC AAGAC T GC AAT T GT T CC AT C T AC C C C GGC C AT ATAAC GGG 

SEQ ID NO. 67 (FR16,lg) 

NNNNNNN GT CAC T GAGAG T GAT AT C C G T GT C GAGGART CAAT T TAC CAAT GC T GT GAC C T GGCC C C 
CGAGGCTCGCGTAGCCATAAAGTCGCTCACTGAGCGGCTATATGTCGGGGGCCCTCTCACCAACTC 
AAAAGGACAGAACTGCGGCTATCGCCGGTGCCGTGCGAGCGGTGTGCTGACTACTAGCTGCGGTAA 
CACCCTCACATGCTACCTGAAAGCCGCCGCGGCCTGTCGAGCTGCAAAGCTCCGGGAATGCACAAT 
GCTCGTGTGTGGCGACGACCTCGTCGTTATCTGTGAGAGTGCGGGGGTCCAGGAGGATGCTGCAAG 
CCTNNNNNNN 

SEQ ID NO. 69 (BNL3,2e) 

C T C G A C AGT C AC AG AGAGAGAT AT AAGNAC T GAGGAGT C CAT AT AC CAGGC TTGTTCCT TAC C C GA 
GCAGGCCAGAACTGCCATACACTCATTGACTGAGAGACTCTACGTAGGAGGGCCCATGATGAACAG 
CAAAGGGCAATCCTGCGGATACAGGCATTGCCGCGCCAGCGGAGTGCTCACCACCAGTATGGGGAA 
TACCATCACGTGCTACATCAAGGCCCTAGCGGCTTGTAAAGCAGCAGGAATAGTGGCCCCCACCAT 
GCTGGTGTGCGGCGATGACCTAGTTGTCATCTCAGAGAGTCAGGGAGTCGAGGAGGACGACCGGAA 
CCTGANNNNN 
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Figure 3 - continued 

SEQ ID NO. 71 (FR4, 2f) 

C T C AAC C G T C AC AGAGAGG GAT ATAAGAAC T GAGGAGT C CATAT ACC T GGCCTGCTCCT TAC C C GA 
GC AGGC C C GGAC T GC CAT AC AT T CAT T AAC T GAGAGAC T T T AC GT GGGAGGGC C CAT GAT GAACAG 
CAAAGGGCAGTCCTGCGGATACAGGCGTTGCCGCGCTAGCGGAGTGCTCACCACCAGTATGGGGAA 
CACCATCACGTGTTATGTGAAAGCCCTCGCAGCTTGTAAAGCTGCGGGCATTGTTGCCCCCACGAT 
GCTGGTGTGCGGCGATGACCTGGTTGTCATCTCAGAGAGTCAGGGGGCTGAGGAGGACGAGCGAAA 
CCTGAGAGTC 

SEQ ID NO. 73 (BNL5,2h) 

C T C AAC AGT C GC GGAGAGAGACAT CAGGACC GAGGAG T C CAT T TACC TTGCCTGCTCCT TAC CC GA 
GCAAGC C C GAAC T GC CAT ACAT T CAT T GAC T GAGAGAC T T TAC GT AGGAGGGCC CATGAT GAACAG 
CAAGGGACAGTCCTGCGGTTACAGACGTTGCCGCGCCAGCGGAGTGCTCACCACCAGCATGGGGAA 
TACCATCACATGCTATGTGAAGGCATTAGCTGCCTGCAAAGCTGCAGGCATCGTTGCTCCCACGAT 
GCTGGTTTGTGGCGACGATCTGGTCATCATCTCAGAGAGTCAGGGAACCGAGGAGGATGAGCGGAA 
CCTGAGAGTC 

SEQ ID NO. 75 (FR13,2k) 

CGNACANCCTCCAGGCCCCCCCCTCCCGGGAGAGCCATAGTGGTCTGCGGAACCGGTGAG 
TACACCGGAATTGCCGGGAAGACTGGGTCCTTTCTTGGATAAACCCACTCTATGCCCGGC 
CATTTGGGCGTGCCCCCGCAAGACTGCTARCCGAGTAGCGTTGGGTTGCGAAAGGCCTTG 
TGGTACTGCCTGATAGGGTGCTTGCGAGTGCCCCGGGAGGTCTCGTAGACCGTGCATCAT 
GAGCACAAAT C C T AAAC C T CAAAGAAAAAC CAAAAGAAAC AC T AAC C GCC GCC CACAGGA 
CGTTAAGTTCCCGGGCGGTGGCCAGATCGTTGGCGGAGTATACTTGTTGCCNTGCAGGGG 
NCCCAGGTNGNGTNTATGCGCAACGANGAAGACTNCCGAACAGTCCCAGCCACGTGGGAG 
GCGCCAGCCCATCCCGAAAGATCGGNGCACCACTGGCAAGTCCTGGGGACGTCCAGGATA 
TCCCTGGCCCCTGTATGGGAACGAGGGCCTCGGGTGGGCAGGGTGGCTCCTGTCCCCCCG 
GGGCTCCCGCCCGTCATGGGGCCCGACGGACCCCCGGCATAGGTCGCGCAACTTGGGTAA 
GGTCATCGATACCCTCACGTNCGGCTTTNCCGACCTCATGGGGTACATTCCCGTCGTTGG 
CGCCCCAGTAGGNGGCGTCGCCAGAGCTCTCGCGCATGGCGTGAGAGTCCTGGAGGACGG 
GATAAACTATGAAACAGGGAACCTCCCCGGTTGCTCTTTCTCTATCTCCCTCCTTGCTCT 
TCTGTCCTGAATTACCGNGCCAGTTTCTGCTGTGGAAATCAAAAACACCAGMAACACATA 
CATGGTGACTAACGACTGTTCAAACAGYAGCATCACCTGGCAGCTTNNGNNCGCGGTGCT 
TCACGTTCCTGGATGCGTCCCCTGTGAACGAGAGGGCAACAGTTCCCGGTGCTGGATTCC 
AGTCACGCCCRACGTAKNCGTGAGCCGACCTGGTGCCCTAACCGAGGGTTTGCGATCGCA 
CAT C GACAC CAT C GT AGCGTC CGCAACAT TTTGTTCTGCCCTC TACAT AGGGGAT GTAT G 
TGGCGCGATAATGATAGCTGCCCAAGTGGTCATCGTCTCGCCGGAGCATCATCACTTTGT 
CCAGGACTGTAACTGTTCCATCTACCCGGGCCACATAACGGGGCCTCGTATGTNG 



SEQ ID NO. 77 (FRl3,2k) 

ATCCACAGTCACTGAAAGAGACATCAGAGTTGAAGAGTCCGTTTATCTGTCCTGTTCACTTCCCGA 
GGAGGC C C GAG C T GC C ATAC AC T CAC T AAC T GAGAGGCT GTACGT GGGAGGT C C CAT GCAGAACAG 
CAAGGGGCAATCCTGCGGATACAGGCGCTGCCGCGCCAGCGGGGTGCTCACCACTAGCATGGGGAA 
TACTCTCACATGCTACTTGAAGGCCCAGGCGGCCTGCAGGGCCGCGGGCATTGTTGCACCCACAAT 
GCTGGTGTGTGGC GAC GAC C T GGT C GT CATC T CAGAGAG T CAGGGGAC T GAGAGGGAC GAGAACAA 
CCTGAGACCT 
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SEQ ID NO. 79 (FR18,21) 

CTCAACAGTCACGGAGAGGGACATCAGGAATGAGGAGTCCATATTCCTGGCCTGCTCGTTGCCCGA 
GGAGGCCCGGACTGTCATACATTCGCTCACTGAGAGACTCTACATAGGCGGGCCGATGATGAACAG 
CAAAGGCCAGTCCTGTGGATACAGGCGTTGTCGCGCCAGCGGGGTGTTCACCACTAGCATGGGCAA 
TACCATCACGTGCTATGTGAAAGCCATGGCAGCTTGCAGAGCTGCCGGGATTGACGCCCCCACAAT 
GTTGGTATGTGGCGACGACCTGGTGGTCATCTCAGAGAGTCAGGGGACCGAGGAGGACGAGCGAAA 

TCTGAGAGTC 

SEQ ID NO. 81 (PAK64,3g) 

CTCTT GAC T C T AC T GT CAC T GAACAGGATAT CAGGGTAGAAGAAGAAATATACCAATGTT GT GACC 
TTGAGCCGGAGGCTAGACGGGCAATCAAATCGCTCACGGAACGGCTTTACGTTGGAGGTCCCATGT 
TCAACAGCAAGGGGCTCAAATGCGGATATCGCCGTTGCCGTGCTAGCGGTGTATTGCCCACTAGCT 
ACGGTAATACAATCACCTGCTACATCAAGGCCAGAGCGGCTGCTCGAGCTGCGGGCCTTCAAGACC 
CATCATTCCTTGTCTGCGGAGATGATTTGGTGGTAGTGGCTGAGAGTTGCGKCGTTGATGAGGAGG 

ATAGGGCAGC 

SEQ ID NO. 83 (BNL8, 4k) 

CTCCACTGTAACCGAAAAGGACATCAGGCCCGAGGAAGAGGTCTATCAGTGTTGTGACCTGGAGCC 
CGAAGCTCGCAAGGTTATTACCGCCCTCACAGAAAGACTCTACGTGGGCGGCCCCATGCACAACAG 
CAAGGGAGACCTTTGTGGGTATCGGAGATGCCGCGCAAGCGGCGTCTACACGACCAGCTTCGGAAA 
CAC AC T GAC G T GC T AC C T C AAAGC C T C AGC T GC TAT TAGAGC GGCAGGGC TGAGAGAC TGCACCAT 
GCTGGTTTGCGGT GAC GAC T T GG T C GT CAT C GC TGAGAGC GAT GGCG TAGAGGAGGATAAC C GAGC 
CCTCCNAGCC 



SEQ ID NO. 85 (BNL12,41) 

CTCCACGGTGACTGAAAAGGACATCAGGGTCGAGGAAGAGATCTATCAATGTTGTGACCTGGARCC 
CGAAGCCCGCAAAGCAATATCCGCCCTCACAGAGAGRCTCTACTTGGGCGGCCCCATGTATAACAG 
CAAAGGGGAGCTCTGCGGGTATCGGAGGTGCCGCGCGAGCGGAGTGTACACCACAAGTTTCGGGAA 
CACAGTGACCTGCTATCTTAAGGCCACCGCAGCTACCAGGGCTGCAGGCCTAAAAGACTGCACCAT 
GCTGGTCTGCGGTGACGACTTGGTCGTCATCGCCGAGAGCGAGGGCGTAGAGGAGGATTCCCAACC 
CCTCCGAGCC 



SEQ ID NO. 87 (EG81,4m) 

CTCCACCGTAACCGAAAGGGACATCAGGGTCGAGGAGGAGGTCTATCAGTGTTGTGATCTGGAGCC 
AGAGGCCCGCAAGGCAATATCCGCCCTCACGGAGAGACTCTATGTGGGCGGTCCCATGTTTAACAG 
CAAGGGAGACCTATGTGGCTACCGCAGGTGCCGCGCAAGCGGCGTCTACACCACCAGCTTCGGAAA 
CACACTGACCTGCTACCTCAAGGCCACGGCCGCTACCAGAGCGGCCGGCCTGAAGGATTGCACAAT 
GCTGGTTTGCGGGGACGACCTGGTCGTCATCGCAGAGAGCGATGGCGTGGACGAGGACCGCCGAGC 

CCTCCAAGCT 

SEQ ID NO. 89 (VN13,7a) 

CTCAACAGTCACAGAGCGCGATGTCCAGACGGAGCATGACATCTACCAGTGCTGTAAGTTGGAGCC 
CGCAGCACGGACAGCCATCACATCGCTTACTGACCGATTGTACTNCGGTGGTCCCATGTNTAACTC 
TAAAGGTCAGGCATGTGGATACCGTAGGTGCAGGGCCAGTGGCGTCTTGACCACCATCCTGGCCAA 
TACTCTGACTTGCTACTTGAAAGCTCAGGCGGCATGCAGAGCTGCCGGGCTGAAGGACTTTGACAT 
GTTGGTCTGCGGAGACGACCTTGTCGTTATTTCGGAGAGTTTGGGGGTCTCGGAGGACACTAGTGC 

ACTGCGAGCT 
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Figure 3 - continued 

SEQ ID NO, 91 (VN4,7c) 

C T C GAC AG T C AC C GAG C GC GAC AT C C RC AC C GAGC AC GACAT C T AC C AAT GC T GCCAAC T T GACC C 
GGTGGCACGCAAGGCTAT TACATCTCTGACTGAGCGGCTGTACTGCGGWGGGCCCATGATGAACTC 
CCGTGGTCAATCATGTGGATACCGTAGGTGCCGAGCCAGTGGCGTGCTCACCACGAGCTTGGGCAA 
TACCCTAACATGCTATTTGAAAGCACAAGCAGCGTGTAGGGCAGCAAAGCTCAAAAACTATGACAT 
G T TAG T C T G C GGAG AC GAT C TAG T C G T TAT C GC GGAGAG T GGAGGAG T C T C T GAGGAT GT T GAC GC 
CCTGCGAGCA 

SEQ ID NO. 93 (VN12,7d) 

CTCCTCCGT C AC GGAGC G T GACAT C C GC AC T GAACAC GACAT C TAT C AGT GC T GC C AAT T AGAT C C 
GGTAGCACGGAAAGCCATTACATCTCTTACTGAGCGGCTGTACTGCGGCGGCCCCATGTACAACTC 
TCGAGGTCAGTCATGTGGGTACCGCAGGTGCCGGGCTAGTGGTGTCTTCACCACAAGCTTGGGCAA 
C AC CAT GACAT GC TAC C T GAAGG C T CAGGC GGC T T GTAGGGCAGCRAAGC T CAAAAAC T T T GACAT 
GTTGGTCTGCGGAGACGACCTAGTCGTTATTGCTGAGAGCGGAGGAGTCCCTGAGGATGCCGGGGC 
CCTGCGAGTC 

SEQ ID NO. 95 (FRl,9a) 

ATCCACAGTCACGGGGCGCGACATACGCACAGAACNAGACATTTACCTGTCCTGCCAGCTCGACCC 
AGAGGCCCGGAAAGCCATAAAGTCTCTCACTGAGAGGCTCTATGTCGGGGGCCCTATGTACAACTC 
AAAGGGCCAACTCTGTGGTCAACGCCGATGCCGAGCAAGCGGAGTACTCCCCACAAGCATGGGTAA 
C AC CAT C AC AT G C T T C C T GAAG G C AA C CGCCGCTTGCC GAGCAGC C GGC T T TACAGAT TAT GACAT 
GTTGGTCTGCGGAGACGATTTGGTTGTCGTAACTGAGAGTGCTGGAGTCAACGAGGATATCGCTAA 
CCTGCGAGCC 



SEQ ID NO. 97 <NE98,10a) 

CTCCACTGTCACTGAGCAGGACATCAGGGTAGAACTTTCCATCTTTCAGGCCTGTGACCTCAAGGA 
CGAGGCTAGGAGGGTGATAACTTCACTCACGGAGCGGCTTTACTGTGGTGGTCCTATGTTCAACAG 
CAAGGGACAACACTGCGGTTACCGCCGCTGCCGTGCTAGTGGGGTGCTACCCACCAGCTTCGGGAA 
C AC AAT C AC C T GT T ACATCAAAGCAAAGGCAGC TACCAAAGC T GCCGGAAT TAAAAAT C CAT CAT T 
CCTTGTCTGC GGAG AT GAC T T GG T C G T GAT T GC T GAGAGT GCAGGGAT CGAT GAGGACAAGAGCGC 
CTTGAGAGCT 

SEQ ID NO. 99 (FR14,lla) 

CTCTACCGTCACAGAGAGGGACATACGGACAGAAGAATCCATCTATCTGTCTTGTCAATTGCCTGA 
AGAGG C C C GGAAAG C CAT T AAAT C GC T GAC AGAGAGAC T ATAC GT GGGC GGCCCGAT GGAAAACAG 
CAAGGGCCAGGCTTGCGGATATAGGCGTTGCCGCGCAAGCGGGGTATTCACCACAAGCTTGGGGAA 
CACCATGACTTGTTACATCAAAGCTAAAGCGGCTTGTAAAGCCGCTGGCATTGTAGACCCGGTGAT 
GCTCGTGTGCGGTGACGACCTAGTGGTCATCTCAGAAAGCAAGGGGGTGGAGGAGGACCAGCGGGA 
CCTACGAGTC 

SEQ ID NO. 101 (FR15,lla) 

CTCCACTGTCACTGAGAGAGACATACGGACAGAAGAATCCATCTAYYTGGCTTGTCAATTGCCCGA 
AGAGG C C C GGAAGGC CAT T AAAT C AC T GACAGAGAGAC TATAC GTGGGCGGCCC GAT GGAAAACAG 
CAAAGGCCAGGCCTGCGGATATAGGCGTTGCCGCGCAAGCGGGGTATTCACCACAAGCTTGGGGAA 
CACCATGACTTGTTACATCAAGGCCAARGCAGCTTGTAAAGCYGCTGGCATTGTTGACCCGGTGAT 
GCTCGTGTGCGGCGACGACCTAGTGGTCATCTCAGAGAGCAAGGGGGTAGAGGAGGACCAGCGAGA 
CCTAC 
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Figure 3 - continued 3 ' 

SEQ ID NO. 103 (FR19,lla) 

CGTACAGCCTCCAGGACCCCCCCTCCCGGGAGAGCCATAGTGGTCTGCGGAACCGGTGAGTACACC 
GGAATTGCCGGGAAGACTGGGTCCTTTCTTGGATTAACCCACTCTATGCCCGGAGATTTGGGCGTG 
CCCCCGCAAGACTGCTAGCCGAGTAGCGTTGGGTTGCGAAAGGCCTTGTGGTACTGCCTGATAGGG 
TGCTTGCGAGTGCCCCGGGAGGTCTCGTAGACCGTGCACCATGAGCACGAATCCTAAACCTCAAAG 
ACAAACCAAAAGAAACACCAACCGCCGCCCACAGGACGTTAAGTTCCCGGGCGGTGGCCAGATCGT 
TGGCGGGGTGTACTTGTTGCCGCGCAGGGGCCCCAGAGTGGGTGTGCGCGCGACGAGAAAGACCTC 
GGAGCGGTCCCAGCCGCGTGGGAGGCGCCAACCTATCCCCAAGGTTAGGCGCACCACCGGCCGTT 

SEQ ID NO. 105 (FR19,lla) 

CTCTACTGTCACAGAGAGGGATATACGAACAGAGGAATCCATYTATCTGGCTTGTCAATTGCCCGA 
AGAGGCCCGGAAGGCCATCAAATCACTGACAGAGAGACTATACGTGGGCGGCCCGATGGAAAACAG 
CAAGGGCCAGGCCTGCGGATACAGGCGTTGCCGCGCAAGCGGGGTATTCACCACAAGCrTGGGGAA 
CACCATGACTTGTTACATCAAAGCCAAGGCGGCTTGTAAAGCCGCTGGCATTGTTGACCCAGTGAT 
GCTCGTGTGCGGCGACGACCTAGTGGTCATCTCAGAAAGCAAGGGGGTGGAGGAGGACCAACGAGA 
CCTACGANTC 

SEQ ID NO. 2 (BNL1, Id) 

MSTNPKPQRKTKRNTNRRPXXXXXPGGGQIVGGVYLLPRRGPRXGVRATRKTSERSQPRGRRQPIP 
KAXRXE GRSWAQPGY PWPL YGNE GCGWAXWLL S PRGSRPNWGP 

SEQ ID NO. 4 (BNL1, Id) 

DGVNYATGNL PGC SFS I FLLALLS CLTVPXTAHEVRNASGVYHVTNDCSNS S 1 1 YEMDGMIMHYPG 
CVPCVREDNHLRCWMALTPTIAVKXASVPTXAIRRHVDLLVGXXTFCSAMYVXDLCGSVFLAGQLF 
TFSPRMHHTTQECNCSI 

SEQ ID NO. 6 (BNL2, Id) 

MSTNPKPQRKTKRNTNRRPQDVKXPGGGQIVGGVYLLPRRGPRLGVRATRKTSERSQPRDRRQPIP 
KARQSDGXXWAQPGHPWPLYGNEGCGWAGWLLSPRGSRPSWGP 

SEQ ID NO. 8 ( BNL2 , Id) 

DGVNYAT GNL PGC S F S I FL LAFL S C L T VPTTAHE VRNASGVYHL TNDC SNS S 1 1 YEMS GMILHAPG 
CVPCVRENNSSRCWJdXLTPTLAVKDANVPTAAIRRHVDLLVGTAAFRSAMYVGDLCGSVFLVGQLF 
TFSPRLYHTTQECNCSI 

SEQ ID NO. 10 (CAM1078, le) 

MSTNPKPQRKTKRNTNRRPQDVKFPGGGQIVGGVYVLPRRGPRLGVRAARKTSERSQPRGRRQPIP 
KERRPEGR 

SEQ ID NO. 12 (FR2, If) 

MSTNPKPQRKTKRNTNRRPQDVKFPGGGQIVGGVYLLPRRGPRLGVRATRKTSERSQPRGRRQPIP 

KARRPEGRSWAQPGYPWPLYANEGCGWAGWLLSPRGSRPSWGPNDPRRRSRNLGKVIDTLTCGFAD 

LMG Y I PL VGAPL GGAS RT LXHGVRVLXGGVXXXXXNLXGC SXX I FLLXLL S CLT VPT SAYE VHS T T 

DGYHVTNDCSNGSIVYEAKDIILHTPGXVPCIREGNISRCWVPLTPTLAARIANAPIDEVRRHVDL 

L VGAAVFC SAMY I GDL CGGVFLVGQL FT F T SRRH WT 

VQDCNCSIYSGHITGHXXX 

SEQ ID NO. 14 (BNL3 2e) 

MSTNPKPQRKTKRNTNRRPQDVKFPGGGQIVGGVYLLPRRGPRLGVRATRKTSERSQPRGRRQPIP 
KDRXATGRSWGRPGYPWPLYGNEGLGWAGWLLSPRGSRPSWG 

SEQ ID NO. 16 (BNL3, 2e) 

TCXXADLMGYXPWGAPVGGXARALAXGVRVLEDGINYXTGNLPGCSFSIFXLALLSCVTVPVSXV 
EVKNTSQAYMATNDCSNNSIVWQLXDAVLHVPGCVPCENSSGRFHCWIPISPNIAVSKPGALTKGL 
RARIDAWMSATLC SAL YVGDVCGAVMIAAQAFI VAPKRHYFVQECNC S I YPGHI TGHRMA 
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Figure 3 - continued 

SEQ ID NO. 18 (FR4, 2f) 

MSTNPKPQRKTKRNTNRRPQDVKFPGGGQIVGGVYLLPRRGPRLGVRAPRKTSERSQPRGRRQPIP 
KDRRATGKSWGRPGYPWPLYGNEGLGWAGWLLSPRGSRPSWGPNDPRHRSRNLGKVIDTLTCGFXD 
LMGYIPWGAPVGGVARALAHGVRVLEDGINYATGNLPGCSFSIFLLALLSCITVPVSAIQVKNNS 
HFYMATNDCANDSIVWQLRDAVLHVPGCVPCERSGNRTFCWTAVSPNVAVSRPGALTRGLRAHIDT 
I VMSATLC SAL YI GDLCGAVMIAAQVAWS PQ YHT FVQECNC SIYPGHI TGHRMX 

SEQ ID NO. 20 ( BNL4 , 2g) 

DGWYATGNL PGC S FS I FLLALLSCVTVPVSAVQVKNTS TMYMATNDCSNNS I IWQMQGAVLHVPG 
CVPCELQGNKSRCWI PVT PNVAVNQPGALTRGLRTHIDT IVMVATLCSALYI GDVCGAVMIAAQW 
IVS PQHHNFSQDCNCS I 

SEQ ID NO. 22 (BNL5, 2h) 

MSTNPKPQRKTKRNTNRRPQDVKFPGGGRSLAEYTCARRGKLRRSSMG 
SEQ ID NO. 24 (BNL5, 2h) 

DGINYATGNLPGCSFSIFLLALLSCLTVPASAVQVKNTSHSYMVTNDCSNSSIVWQLKDAVLHVPG 
CVPCERHQNQSRCWIPVTPNVAVSQPGALTRGLRTHIDTIVASATVCSALYVGDFCGAVMLVSQFF 
MI S PQHHI FVQDCNCS I 

SEQ ID NO. 26 (BNL6, 2i) 

DGINYATGNLPGC S FS I FLLALL SCI TVPVSAVQVANRSGS YMVTNDCSNS SIVWQLEEAV1HVPG 
CVPCE WKDNT SRCWI PVT PNIAVSQPGAXTKGLRTHIDI IVASAT FC SAL YV 

SEQ ID NO. 28 ( BNL7 , 4k) 

MSTNPKPQRKTKRNTNRRPMDVKFPGGGQIVGGVYLLPRRGPRLGVRATRKTSERSQPRGRRQPIP 
KARRSEGRSWAQPGYPWPLYGNEGCGWAXWLLSPRGSRPSWGPNDPRRRSR 

SEQ ID NO. 30 (BNL7, 4k) 

DGINFATGNLPGCSFSIFLLALLSCLTVPASAINYRNVSGIYYVTNDCPNSSIVYEADHHILHLPG 
C VP C VRE GNQ S RCWVAL T PT VAAP Y I GAPLE SLRSHVDLMVGAAT VC SAL YI GDXCXGL FL VGQMF 
S FRPRRHWT T QDCNC S I 

SEQ ID NO. 32 (BNL8, 4k) 

DG IN YAT GNL PGC S F S I FLLALL S CL T VPASAINYRNT SGI YHVTNDC PNS S I VYEADHH I LHL PG 
CVPCVRTGNQSRCWVALTPTVAAPYIGAPLESLRSHVDLMVGAATVCSALYIGDLCGGLFL VGQMF 
S FR PRRHWT AQDCNC S I 

SEQ ID NO. 34 (BNL9, 4k) 

DGINYATGNLPGCSFSIFLLALLSCLTVPASAINYHNTSGIYHITNDCPNSSIVYEADHHILHLPG 
CVPCVRVGNQSSCWVALTPTIAAPYIGAPLESLRSHVDLMVGAATVCSALYIGDLCGGAFL VGQMF 
S FRPRRHWT T QDCNC S I 

SEQ ID NO. 36 (BNL10, 4k) 

DGINYATGNI PGCXFS I FLXALLSCLTVPASATNYRNVSGI YHVTNDCPNSSIVYEADHHILALPG 
CVPCVRVGNQSRCWVALTPTVAAPYTAAPLESLRSHVDLMVGAATVCSALYIGXLCGGLFLVGQMF 
SXQ PRRHWT T QDCNC S I 

SEQ ID NO. 38 (BNL11, 4k) 

DC-INYAT GXL PGC S F S I FLLALL SCL TVPASATNYRNVSGI YHVTNDC PNS S I VFEADHHILHLPG 
CVPCVKEGNHSRCWVALTPTVAAPYIGAPLESLRSHVDVMVGAATVCSALYIGDLCGGLFLVGQMF 
SFRPRRHWTTQECNCSI 

SEQ ID NO. 40 (BNL12, 41) 

DGINYATGNLPGCSFSIFILALLSCLTVPASAQHYRNVSGIYHVTNDCPNSSIVYESDHHILHLPG 
CVPCVKT GNT SRC WVALT PTVAAPILSAPLMSVRRHVDLMVGAATLS SAL YVGDLCGGAFLVGQMF 
T FQ PRRHWT VQDCNC S I 
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Figure 3 - continued 

SEQ ID NO. 4 6 (VN13, 7a) 

MSTLPKPQRKTKRNTNRRPQDVKFPGGGQIVGGVYLLPRRGPRLGVRATRKTSERSQPRGRRQPIP 
KVRHQTGRTWAQPGYPWPLYGNEGCGWAGWLLSPXGSRPNWGPNDPRXRSRNLGKVIDTLTXXFAD 
LIEYI 



SEQ ID NO. 44 (VN4, 7c) 

MSTLPKPQRKTKRNTIRRPQDVKFPGGGQIVGGVTLLPRRGPRLGVRATRKTSERSQPRGRRQPIP 
KVRHQTGRTWAQPGYPWPLYGNEGCGWAGWLLSPRGSRPNWGPNDPRRRSRNLGKVIDTLTCGFAD 
LMGYIPWGAPXGGVAXALAHGVXXIEDXVNYATXNLPXXSXSIXLLALLSCLTTPASAAHYTNKS 
GLYHLTNDCPNSSIVYEAETLILHLPGCVPCVKXXNQSRCWQASPTLAVPNASTPVTGFRKHVDI 
MVGAAAFC SAMYVGDL C GGL FLVGQLFTLR PRMHQWQECNC SIYTGHI TGHRMA 

SEQ ID NO. 48 (VN12, 7d) 

MSTLPKPQRKTKRNTNRRPMDVKFPGGGQIVGGVYLLPRRGPRLGVRATRKTSERSQARGRRQPIP 
KVRQNQGRTWAQPGYPWPLYGNEGCGWAGWLLSPRGSRPDWXPNDPRXRSRNLGKVIDTLTCGFAD 
LME Y I PVVGAPLGGVAAELXHGVRAIEDGINYATGNLPGC SFS I FXLALLSCLTT PASALNYANKS 
GLYHLTNDCPNSSIVYEANGMILHLPGCVPCVKTGNLTKCWLSASPTLAVQNASVSIRGVREHVDL 
LVGAAAFC SAMYVGDLCGGLFLVGQLFT FRPRMYE IAQDCNC S I YAGHI TGHRMA 

SEQ ID NO. 42 (FR1, 9a) 

MSTLPKPQRKTKRNTNRRPMDVKFPGGGQIVGGVYLLPRRGPRLGVRATRKTSERSQPRGRRQPIP 
KVRQPTGRSWGQPGYPWPLYGNEGCGWAGWLLSPRGSRPNWGPNDPRRRSRNLGKVIDTLTXXLAD 
LMGYIPVLGGPLGGVAAALAHGVRAIEDGVNYATGNLPGCSFSIFLLALLSCLTTPASAIQVKNAS 
GIYHLTNDCSNNSIVFEAETMILHLPGCVPCIKAGNESRCP7LPVSPTLAVPNSSVPIHGFRRHVDL 
LVGAAAFCSAMYIGDLCGSIILVGQLFTFRPKYHQVTQDCNCSXNXGHVTGHRMA 

SEQ ID NO. 50 (NE98, 10a) 

MSTLPKPQRKTKRNTNXRPQDVKFPGGGQIVGGVYVLPRRGPQLGVRAVRKTSERSQPRSRRQPIP 
RARRTEGRSWAQPGYPWPLYGNEGCGWAGWLLSPRGSRPSWGPNDPRRR 

SEQ ID NO. 52 (NE98, 10a) 

DGINFATGNLPGCSFSIFLLALFSCLLTPTAGLEYRNASGLYMVTNDCSNGSIVYEAGDIILHLPG 
CVPCVRSGNTSRCWIPVSXTVAVKSPCAATASLRTHVDMMVXAATLCSALYVGDLCGALFLXGQGF 
S WRHRQHWTVQDCNC S I 

SEQ ID NO. 54 (BNL1, Id) 

STVTENDIRVEESIYQCCDLAPEARKAIKSLTERLYIGGXLTNSKGQNCGYRRCRASGVLTTSCGN 
TLTCYLKARAACRAAKLRDCTMLVCGDDLWICESAGVEEDAANLRA 

SEQ ID NO. 56 (BNL2, Id) 

STVTENDIRTEXSIYQCCDLAXEARKAIKSLTERLYVGGPLTNSKGQNCGYRRCRASGVLTTSCGN 
TLTCYLKARAACRAAKLQDCTMLVCGDDLWICESAGVEEDAANLRV 

SEQ ID NO. 58 (FR17,ld) 

STVTENDIRVEESIYQCCDLAPEARKAIKSLTERLYIGGPLTNSKGQNCGYRRCRASGVLTTSCGN 
TLTCYLKA^AACRAAKLQDCTMLVCGDDLWICESXGVEEDAANLRV 
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SEQ ID NO. 60 (CAMl078,le) 

MSTNPKPQRKTKRNTNRRPQDVKFPGGGQIVGGVYVLPRRGPRLGVRAARKTSERSQPRGRRQPIP 
KERRPEGRSWAQPGYPWPLYGNEGCGWAGXLLSPRGSRPSWGPTDPRRRSRNLGKVIDTLTCXFAD 
LMGYIP 

SEQ ID NO. 62 (CAM1078,le) 

STVTEADIRTEESIYQCCDLHPEARVAIKSLTERLYVGGPLTNSKGENCGYRRCRASGVLTTSCGN 
TLTCYIKALAACRAAKLQDCTMLVCGDDLWICESVGTQEDAASLRA 

SEQ ID NO. 64 (FR2, If) 

STVTESDIRTEESIYQCCDLDPEARKAIRSLTERLYIGGPLTNSKGQNCGYRRCRASGVLTTSCGN 
TLTCYIKARAACRAAKLQDCSMLVCGDDLWICEIEGXXEDPSXXXX 

SEQ ID NO. 66 (FRl6,lg) 

MSTNPKPQRKTKRNINRRPQDVKFPGGGQIVGGVYLLPRRGPRLGVRATRKTSERSQPRGRRQPIP 
KARRSEGRSWAQPGYPWPLYGNEGMGWAGWLLSPHGSRPSWGPSDPRRRSRNLGKVIDTLTCGFAD 
LMGYI PLVGAPLGGVARALAQGFRDL 

SEQ ID NO. 68 (FR16,lg) 

XXVTESDIRVEXSIYQCCDLAPEARVAIKSLTERLYVGGPLTNSKGQNCGYRRCRASGVLTTSCGN 
TLTCYLKAAAACRAAKLRECTMLVCGDDLWICESAGVQEDAASXXX 

SEQ ID NO. 70 (BN13,2e) 

STVTERDIXTEESIYQACSLPEQARTAIHSLTERLYVGGPMMNSKGQSCGYRHCRASGVLTTSMGN 
T I T C Y I KALAACKAAG I VAPTMLVCGDDL WI SE S QGVEEDDRNLXX 

SEQ ID NO. 72 (FR4, 2f) 

STVTERDIRTEESIYLACSLPEQARTAIHSLTERLYVGGPMMNSKGQSCGYRRCRASGVLTTSMGN 
T I TC YVKALAACKAAGIVAPTMLVCGDDLWI SE SQGAEEDERNLRV 

SEQ ID NO. 74 (BNL5, 2h) 

STVAERDIRTEESIYIACSLPEQARTAIHSLTERLWGGPMMNSKGQSCGYRRCRASGVLTTSMGN 
T I TC YVKAIAACKAAGIVAPTMLVCGDDLVI I SE SQGTEEDERNLRV 

SEQ ID NO. 76 (FRl3,2k) 

MSTNPKPQRKTKRNTNRRPQDVKFPGGGQIVGGVYLLXCRXPRXXXCATXKTXEQSQPRGRRQPIP 
KDRXTTGKSWGRPGYPWPLYGNEGLGWAGWLLSPRGSRPSWGPTDPRHRSRNLGKVIDTLTXGFXD 
LMGYI PVV GAP VXGVARALAHGVRVLEDGINYETGNLPGCSFSISLLALLSITXPVSAVEIKNTXN 
TYMVTNDCSNXSITWQLXXAVLHVPGCVPCEREGNSSRCWIPVTPXVXVSRPGALTEGLRSHIDTI 
VASAT FC SAL YIGDVCGAIMIAAQWI VS PEHHHFVQDCNC S I YPGHI TGPRMX 

SEQ ID NO. 78 (FR13,2k) 

STVTERDIRVEESVYLSCSLPEEARAAIHSLTERLYVGGPMQNSKGQSCGYRRCRASGVLTTSMGN 
TLTCYLKAQAACRAAGIVAPTMLVCGDDLWISESQGTERDENNLRP 

SUBSTITUTE SHEET (RULE 26) 



WO 96/13590 PCT/EP95/04155 

42/74 

Figure 3 - continued 

SEQ ID NO. 80 (FR18,21) 

STVTERDIRNEESIFLACSLPEEARTVIHSLTERLYIGGPMMNSKGQSCGYRRCRASGVFTTSMGN 
TITCYVKAMAACRAAGIDAPTMLVCGDDLWISESQGTEEDERNLRV 

SEQ ID NO. 82 (PAK64,3g) 

STVTEQDIRVEEEIYQCCDLEPEARRAIKSLTERLYVGGPMFNSKGLKCGYRRCRASGVLPTSYGN 
TITCYIKARAAARAAGLQDPSFLVCGDDLVWAESCXVDEEDRAALR 

SEQ ID NO. 84 (BNL8, 4k) 

STVTEKDIRPEEEVYQCCDLEPEARKVITALTERLYVGGPMHNSKGDLCGYRRCRASGVYTTSFGN 
TL T C YLKAS AAI RAAGLRDC TML VCGDDLWI AE S DGVE E DNRALXA 

SEQ ID NO. 86 (BNL12,41) 

STVTEKDIRVEEEIYQCCDLXPEARKAISALTEXLYLGGPMYNSKGELCGYRRCRASGVYTTSFGN 
TVTCYLKATAATRAAGLKDCTMLVCGDDLWIAESEGVEEDSQPLRA 

SEQ ID NO. 88 (EG81, 4m) 

STVTERDIRVEEEVYQCCDLEPEARKAISALTERLYVGGPMFNSKGDLCGYRRCRASGVYTTSFGN 
TLT C YLKATAATRAAGLKDC TMLVCGDDLWIAE SDGVDEDRRALQA 

SEQ ID NO. 90 (VN13,7a) 

STVTERDVQTEHDIYQCCKLEPAARTAITSLTDRLYXGGPMXNSKGQACGYRRCRASGVLTTILAN 
TLT C YLKAQAACRAAGLKDFDMLVCGDDLWI SE SLGVSEDT SALRA 

SEQ ID NO. 92 (VN4,7c) 

STVTERDIXTEHDIYQCCQLDPVARKAITSLTERLYCXGPMMNSRGQSCGYRRCRASGVLTTSLGN 
TLT C YLKAQAACRAAKLKNYDMLVCGDDLWIAE S GGVSEDVDALRA 

SEQ ID NO. 94 (VN12,7d) 

SSVTERDIRTEHDIYQCCQLDPVARKAITSLTERLYCGGPMYNSRGQSCGYRRCRASGVFTTSLGN 
TMTCYLKAQAACRAXKLKNFDMLVCGDDLWIAESGGVPEDAGALRV 

SEQ ID NO. 96 (FRl,9a) 

STVTGRDIRTEXDIYLSCQLDPEARKAIKSLTERLYVGGPMYNSKGQLCGQRRCRASGVLPTSMGN 
T I T CFLKATAACRAAGFTDYDMLVCGDDLVWTE SAGVNEDIANLRA 

SEQ ID NO. 98 (NE98,10a) 

STVTEQDIRVELSIFQACDLKDEARRVITSLTERLYCGGPMFNSKGQHCGYRRCRASGVLPTSFGN 
T I T C Y I KAKAATKAAG I KNP S FLVC GDDL WI AE SAG I DE DKSALRA 

SEQ ID NO. 100 (FRl4,lla) 

STVTERDIRTEESIYLSCQLPEEARKAIKSLTERLYVGGPMENSKGQACGYRRCRASGVFTTSLGN 
TMTC YIKAKAACKAAGIVDPVMLVCGDDLWI SE SKGVEEDQRDLRV 
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SEQ ID NO. 102 (FR15,lla) 

STVTERDIRTEESIXXACQLPEEARKAIKSLTERLYVGGPMENSKGQACGYRRCRASGVFTTSLGN 
TMTCYIKAXAACKXAGIVDPVMLVCGDDLWISESKGVEEDQRDLXX 

SEQ ID NO. 104 (FR19,lla) 

MSTNPKPQRQTKRNTNRRPQDVKFPGGGQIVGGVYLLPRRGPRVGVRATRKTSERSQPRGRRQPIP 

ECVRR T T GR 

SEQ ID NO. 106 (FR19,lla) 

STVTERDIRTEESXYLACQLPEEARKAIKSLTERLYVGGPMENSKGQACGYRRCRASGVFTTSLGN 
TMT C Y I KAKAACKAAG I VDPVMLVC GDDL WI SE SKGVEEDQRDLRX 
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DECLARATION 



As below named inventors, we hereby declare that: 

Our residence, post office address and citizenship are as stated below next to our names. 

The below named inventors are the original, first and joint inventors of the subject matter 
which is claimed and for which a patent is sought on the invention entitled NEW SEQUENCES 
OF HEPATITIS C VIRUS GENOTYPES AND THEIR USE AS PROPHYLACTIC, 
THERAPEUTIC AND DIAGNOSTIC AGENTS, the specification of which was filed as PCT 
International Application No. PCT/EP95/04155 on October 23, 1995 and was not amended 
under PCT Article 19. 

We hereby state that we have reviewed and understand the contents of the above 
identified specification, including the claims. 

We acknowledge the duty to disclose to the Patent and Trademark Office all information 
known to us to be material to patentability of the subject matter claimed in this application, as 
"materiality" is defined in Title 37, Code of Federal Regulations, § 1 .56. 

We hereby claim foreign priority benefits under Title 35, United States Code, § 119 (a)- 
(d) of any foreign application(s) for patent listed below and have also identified below any 
foreign application for patent or inventor's certificate having a filing date before that of the 
application on which priority is claimed. 

PRIOR FOREIGN APPLICATION (S) Priority Claimed 

95870076.7 Europe 28 June 1995 Yes 

(Number) (Country) (Date Filed) 

94870166.9 Europe 21 October 1994 Yes 

(Number) (Country) (Date Filed) 

We hereby claim the benefit under Title 35, United States Code, § 120 of any United 
States application(s), or § 365(c) of any PCT International application designating the United 
States of America, listed below and, insofar as the subject matter of each of the claims of this 
application is not disclosed in the prior United States or PCT International application in the 
manner provided by the first paragraph of Title 35, United States Code, § 1 12, we acknowledge 
the duty to disclose all information known to me to be material to patentability of the subject 
matter claimed in this application, as "materiality" is defined in Title 37, Code of Federal 
Regulations, § 1.56, which become available between the filing date of the prior application and 
the national or PCT international filing date of this application. 

PCT/EP95/04155 October 23, 1995 

(International Application No . ) (International Filing Date) 

We hereby direct that all correspondence and telephone calls be addressed to: 

„ ^Patricia A J&ainmerer 
ArnoJd,JVatite &j?urkee 
- P. Q. .BJOX 4433 

(713)787-1438 
Page 1 of 2 

Declaration of G. Maertens 
and L. Stuyver 
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attorneys for the prospective assignee of this application. 



WE HEREBY DECLARE THAT ALL STATEMENTS MADE OF OUR OWN 
KNOWLEDGE ARE TRUE AND THAT ALL STATEMENTS MADE ON INFORMATION 
AND BELIEF ARE BELIEVED TO BE TRUE; AND FURTHER THAT THESE 
STATEMENTS WERE MADE WITH THE KNOWLEDGE THAT WILLFUL FALSE 
STATEMENTS AND THE LIKE SO MADE ARE PUNISHABLE BY FINE OR 
IMPRISONMENT, OR BOTH, UNDER SECTION 1001 OF TITLE 18 OF THE UNITED 
STATES CODE AND THAT SUCH WILLFUL FALSE STATEMENTS MAY JEOPARDIZE 
THE VALIDITY OF THE APPLICATION OR ANY PATENT ISSUED THEREON. 



Inventor's Full 
Name 


MAERTENS GEEEI 


Inventor ' s ' """""X 

Signature 


Date: ^ A f yq-oj^ Country of Belgium 
W 7 *-rf v ^ 3 /y Citizenship: 


Residence Address 


Zilversparrenstraat 64 
B-83 10 Brugge M^~y 
BELGIUM ^ \ 


Post Office Address, 
if different from above 


same as above 




Inventor's Full 
Name 


STUYVER JLIEVEN. 


Inventor ' s - if^^T 

Signature ^^^> 


Date: A ft Country of Belgium 
A^CJL ^ 1 > Citizenship: 


Residence Address 


Holestraat 8 fi-^K 
B21Q0 Mol -6-9552 Jfe^^ 
BELGIUM 


Post Office Address, 
if different from above 


same as above 
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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



Int'l App. No. PCT/EP95/041 55 
Group Art Unit: Unknown 
Examiner: Unknown 
Atty. Docket No.: INNS004/KAM 



In re Application of : GEERT MAERTENS 

and LIEVEN STUYVER 

Serial No. : Unknown 

LA. filing date: October 23, 1995 

For: NEW SEQUENCES OF HEPATITIS C 

VIRUS GENOTYPES AND THEIR USE AS 
PROPHYLACTIC, THERAPEUTIC AND 
DIAGNOSTIC AGENTS 

VERIFIED STATEMENT (DECLARATION) CLAIMING SMALL ENTITY STATUS 
(37 CFR 1.9(f) and 1.27(c)) - SMALL BUSINESS CONCERN 

I hereby declare that I am an official of the small business concern empowered to act on behalf of the concern 
identified below: 



NAME OF CONCERN: 
ADDRESS OF CONCERN: 



INNOGENETICSN.V. 
Industriepark,Zwijnaarde 7, Box 4 
B-9052 Gent, BELGIUM 



I hereby declare that the above identified small business concern qualifies as a small business concern as 
defined in 13 CFR 121.3-18, and reproduced in 37 CFR 1.9(d), for purposes of paying reduced fees under Section 
41(a) and (b) of Title 35, United States Code, in that the number of employees of the concern, including those of its 
affiliates, does not exceed 500 persons. For purposes of this statement, (1) the number of employees of the 
business concern is the average over the previous fiscal year of the concern of the persons employed on a full- 
time, part-time or temporary basis during each of the pay periods of the fiscal year, and (2) concerns are affiliates 
of each other when either, directly or indirectly, one concern controls or has the power to control the other, or a 
third party or parties controls or has the power to control both. 

I hereby declare that rights under contract or law have been conveyed to and remain with the small 
business concern identified above with regard to the invention entitled NEW SEQUENCES OF HEPATITIS C 
VIRUS GENOTYPES AND THEIR USE AS PROPHYLACTIC, THERAPEUTIC AND DIAGNOSTIC 
AGENTS by inventors described in the specification filed as International Application No. PCT/EP95/04155. 

I acknowledge the duty to file, in this application or patent, notification of any change in status resulting in 
loss of entitlement to small entity status prior to paying, or at the time of paying, the earliest of the issue fee or any 
maintenance fee due after the date on which status as a small entity is no longer appropriate. (37 CFR 1.28(b)) 

I HEREBY DECLARE THAT ALL STATEMENTS MADE HEREIN OF MY OWN KNOWLEDGE ARE 
TRUE AND THAT ALL STATEMENTS MADE ON INFORMATION AND BELIEF ARE BELIEVED TO BE 
TRUE; AND FURTHER THAT THESE STATEMENTS WERE MADE WITH THE KNOWLEDGE THAT 
WILLFUL FALSE STATEMENTS AND THE LIKE SO MADE ARE PUNISHABLE BY FINE OR 
IMPRISONMENT, OR BOTH, UNDER SECTION L00L OF TITLE L8 OF THE UNITED STATES CODE, 
AND THAT SUCH WILLFUL FALSE STATEMENTS MAY JEOPARDIZE THE VALIDITY OF THE 
APPLICATION, ANY PATENT ISSUING THEREON, OR ANY PATENT TO WHICH THIS VERIFIED 
STATEMENT IS DIRECTED. 



Date: f\ fj . ffi-ff? 



SIGNATURE: 
By: 




Dr. Hugo Van Heuverswyn 

Managing Director, INNOGENTICS N. V. 

Co 1 m an straat 62 

B-9270 Kalken, Belgium 



